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Lecture Course 491 — Advanced Quantum Theory 
F. J. Dyson — Fall Semester 1951 



Introduction 



Books 

W. Pauli, "Die Allgemeinen Principien der Wellenmechanik" ; Handbuch der Physik, 2 ed., Vol. 24, 
Part 1; Edwards reprint, Ann Arbor 1947. (In German) [1] 

W. Heitler, Quantum Theory of Radiation, 2nd Edition, Oxford. 3rd edition just published. [2] 

G. Wentzel, Introduction to the Quantum Theory of Wave-Fields, Interscience, N.Y. 1949 [3] 

I shall not expect you to have read any of those, but I shall refer to them as we go along. The later part 
of the course will be new stuff, taken from papers of Feynman and Schwinger mainly. [4] , [5] , [6] , [7] , [8] 

Subject Matter 

You have had a complete course in non-relativistic quantum theory. I assume this known. All the general 

principles of the NR theory are valid and true under all circumstances, in particular also when the system 
happens to be relativistic. What you have learned is therefore still good. 

You have had a course in classical mechanics and electrodynamics including special relativity. You know 
what is meant by a system being relativistic; the equations of motion are formally invariant under Lorentz 
transformations. General relativity we shall not touch. 

This course will be concerned with the development of a Lorentz-invariant Quantum theory. That is not 
a general dynamical method like the NR quantum theory, applicable to all systems. We cannot yet devise 
a general method of that kind, and it is probably impossible. Instead we have to find out what are the 
possible systems, the particular equations of motion, which can be handled by the NR quantum dynamics 
and which are at the same time Lorentz-invariant. 

In the NR theory it was found that almost any classical system could be handled, i.e. quantized. Now 
on the contrary we find there are very few possibilities for a relativistic quantized system. This is a most 
important fact. It means that starting only from the principles of relativity and quantization, it is mathe- 
matically possible only for very special types of objects to exist. So one can predict mathematically some 
important things about the real world. The most striking examples of this are: 

(i) Dirac from a study of the electron predicted the positron, which was later discovered [9]. 

(ii) Yukawa from a study of nuclear forces predicted the meson, which was later discovered [10]. 

These two examples are special cases of the general principle, which is the basic success of the relativistic 
quantum theory, that A Relativistic Quantum Theory of a Finite Number of Particles is Impossible. A RQ 
theory necessarily contains those features: an indefinite number of particles of one or more types, particles 
of each typo being identical and indistinguishable from each other, possibility of creation and annihilation 
of particles. Thus the two principles of relativity and quantum theory when combined lead to a world built 
up out of various types of elementary particles, and so make us fool quite confident that we are on the right 
way to an understanding of the real world. In addition, various detailed properties of the observed particles 
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are necessary consequences of the general theory. These are for example; 

(i) Magnetic moment of Electron (Dirac) [9] . 

(ii) Relation between spin and statistics (Pauli) [11]. 

Detailed Program 

We shall not develop straightaway a correct theory including many particles. Instead we follow the historical 
development. We try to make a rclativistic quantum theory of one particle, find out how far wc can go 
and where we get into trouble. Then we shall see how to change the theory and get over the trouble by 
introducing many particles. Incidentally, the one-particle theories are quite useful, being correct to a good 
approximation in many situations where creation of new particles does not occur, and where something 
better than a NR approximation is needed. An example is the Dirac theory of the H atom.^ 

The NR theory gave levels correctly but no fine-structure. (Accuracy of one part in 10,000). The Dirac 
one-particle theory gives all the main features of the fine-structure correctly, number of components and 
separations good to 10% but not better. (Accuracy one part in 100,000). 

The Dirac many-particle theory gives the fine-structure separations (Lamb experiment) correctly to 
about one part in 10,000. (Overall accuracy 1 in 10*.) 

Probably to get accuracy better than 1 in 10* even the DMP theory is not enough and one will need 
to take all kinds of meson effects into account which are not yet treated properly. Experiments are so far 
only good to about 1 in 10*. 

In this course I will go through the one-particle theories first in detail. Then I will talk about their 
breaking down. At that point I will make a fresh start and discuss how one can make a relativistic quantum 
theory in general, using the new methods of Feynman and Schwinger. From this we shall be led to the 
many-particle theories. I will talk about the general features of these theories. Then I will take the special 
example of quantum electrodynamics and get as far as I can with it before the end of the course. 

One-Particle Theories 

Take the simplest case, one particle with no forces. Then the NR wave-mechanics tells you to take the 

equation E = — of classical mechanics, and write 
2m 

E ih^^ ^ -z/.^ (1) 



to get the wave-equation^ 



-.9 , f 92 ^2 X f^2 



dt 2m \dx^ dy-^ dz^ J 2m 



satisfied by the wave-function tp. 

To give a physical meaning to ^p, wc state that p = tl)*il) is the probability of finding the particle at the 
point X y z at time t. And the probability is conserved because^ 

|.V.J=0 (3, 



where 



j = ^(rvv-vvv*) (4) 



where ip* is the complex conjugate of ip. 

Now do this relativistically. We have classically 

E^ = m^c^ + cV (5) 
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which gives the wave equation 



T2a^V' = V2V'-^V (6) 



c 



This is an historic equation, the Klein-Gordon equation. Schrodinger already in 1926 tried to make a RQ 
theory out of it. But he failed, and many other people too, until Pauli and Weisskopf gave the many-particle 
theory in 1934 [12]. Why? 

Because in order to interpret the wave-function as a probability we must have a continuity equation. 
This can only be got out of the wave-equation if we take j as before, and 

ih f i^dip dtp* 



But now since the equation is 2"^^ order, t/j and are arbitrary. Hence p need not be positive. We have 

Negative Probabilities. This defeated all attempts to make a sensible one-particle theory. 

The theory can be carried through quite easily, if we make tjj describe an assembly of particles of 
both positive and negative charge, and p is the net charge density at any point. This is what Pauli and 
Weisskopf did, and the theory you get is correct for 7r-mesons, the mesons which are made in the synchrotron 
downstairs. I will talk about it later. 



The Dirac Theory 



The Form of the Dirac Equation 

Historically before the RQ theory came the one-particle theory of Dirac. This was so successful in dealing 
with the electron, that it was for many years the only respectable RQ theory in existence. And its difficulties 
are a lot less immediate than the difficulties of the one-particle KG theory. 

Dirac said, suppose the particle can exist in several distinct states with the same momentum (different 
orientations of spin.) Then the wave-function ip satisfying (6) must have several components; it is not a 
scalar but a set of numbers each giving the prob. amplitude to find the particle at a given place and in a 
given substate. So we write for tp a column matrix 



1p2 



for the components ipal a = 1,2, . 



Dirac assumed that the probability density at any point is still given by 

a 

which we write 



(8) 



as in the NR theory. Here V'* is a row matrix 



[ri,r2, •••] 

We must have (3) still satisfied. So ip must satisfy a wave-equation of First Order in t. But since the 
equations arc rclativistic, the equation has to be also of 1^* order in x y z. Thus the most general possible 
wave-equation is 



1 d-ip k dtp ."ic ^ , 



dxk h 



(9) 



where xi X2 X3 are written for x y z and o? c? /3 are square matrices whose elements are numbers. The 
conjugate of (9) gives 



\ d^}* MT^dip* .mc ^ 



(10) 



where of'* and (3* are Hermitian conjugates. 

Now to get (3) out of (8), (9) and (10) we must have a'^* = a'', (3* — (3 so and (3 are Hermitian; and 



jk = c {tp* a^tp) 



(11) 
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Next what more do we want from equation (9)? Two things. (A) it must be consistent with the 2"'^ 
order equation (6) we started from; (B) the whole theory must be Lorentz invariant. 

First consider (A). If (9) is consistent with (6) it must be possible to get exactly (6) by multiplying (9) 

by the operator 



1 d f d . I lie „ 



(12) 



d d d 

chosen so that the terms with mixed derivatives — -- — and — cancel. This gives 

Ot OXk ot 



_JPjj_ 

dxk dxi 



This agrees with (6) if and only if 



a'=/3 + /3a'= =0 



(13) 



k2 



0^ = I, (identity matrix) 



Thus we could not possibly factorize the 2""^ order equation into two first-order operators involving ordinary 
numbers. But we can do it with matrices. 



Consider the Pauli spin matrices 



CTl 



1 

1 



0-2 



-i 

1 



(^3 



you are familiar with. They satisfy 



But we cannot make 4 matrices of this type all anti-commuting. They must be at least 4x4. 



One possible set of a*^ and P is 



In particular 



(Jk 
ak 



/o 








1^ 


/ 








1 










1 








V 








0^ 


V 



i 
-i 
/ 



a 








1 


^\ 











-1 


1 











Vo 


-1 








(14) 



(15) 



These are hermitian as required. Of course if a'' and /? are any set satisfying (13) then Sa''S~^ and 
S(3S~^ arc another set, where S is any unitary matrix SS* = 1. And conversely it can be proved that 
every possible 4x4 matrices a'^ and /? are of this form with some such matrix S. We do not prove this here. 



The Dirac equation is thus a set of 4 simultaneous linear partial differential^ equations in the four 
functions ipa- 
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Lorentz Invariance of the Dirac Equation 

What does this mean? Consider a general Lorentz transformation: If x'^ are the new coordinates: 

3 

x'^ = ^ an„Xi, {xo = ct) (16) 

In the new coordinate system the wave-function will be if}' . Clearly we do not expect that ip' = tp. Example: 
in the Maxwell theory which is rclativistic, the magnetic field H is no longer a pure magnetic field in a 
moving system. Instead it transforms like a tensor. So we have to find some transformation law for the i/} 
which will leave invariant the physical consequences of the equations. 

Wc need in fact two things: (i) the interpretation of tjj*ip as a probability density must be preserved, 
(ii) the validity of the Dirac equation must be preserved in the new system. 

First consider (i). The quantity which can be directly observed and must be invariant is the quantity 

(tP*^) X V 

where F is a volume. Now in going to a new Lorentz system with relative velocity v the volume V changes 
by Fitzgerald contraction to the value 



V = V\ll 
Therefore 

(*•>') = ^S== (17) 

and so (V'*V') = P transforms like an energy, i.e. like the fourth component of a vector. This shows incidentally 
that ip' ^ tl). Since p and j are related by the equation of continuity, the space-components of the 4- vector 
are 

(5i,52,53) = V*aV = -Jfc (18) 
c 

So we require that the 4 quantities 

{Si, 52, Ss, So) = r^) (19) 

transform like a 4-vector. This will be enough to preserve the interpretation of the theory. 

Assume that 

ip' = S^p (20) 

where 5 is a linear operator. Then 

= il,*S* (21) 

So we require 



V'*'a'=V' = i^*S*a^Sil) = ^ au^i^^a^i) (22) 

i/=0 

3 

i/=0 

writing =1. 
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Thus we need 

3 

S'*a''S' = ^a^i.a", /x = 0,1,2,3 (23) 
Next consider (ii). The Dirac equation for ip' is 

Now the original Dirac equation for ip expressed in terms of the new coordinates is 

.3 3 

E E a^"-^^^" V' + V = (25) 

The sets of equations (24) and (25) have to be equivalent, not identical. Thus (25) must be the same as 
(24) multiphed by pS~'^/3. The condition for this is 

3 

l3S-^l3a" = a^a^xS-^^ (26) 



But (23) and (26) are identical if 

138-^13 = S* which means S* pS = P (27) 
Thus /? transforms like a scalar, a" like a 4-vector when multiplied by 5**5'. 



To find the S 

Given two coordinate transformations in succession, with matrices ahcady found, the combined transfor- 
mation will correspond to the product of these matrices. Hence we have to consider only 3 simple types of 
transformation. 



1) Pure rotations 

2^0 — ^'z— 2^3 

x\ = x\ COS 6 + X2 sin 6 
x'2 = —xi sin 6 + X2 cos 6 



2) Pure Lorentz transformations 

x'l = Xi x'2 = X2 

x'^ = X3 cosh + xq sinh 6 
x'q = X3 sinh 6 + xo cosh 6 



3) Pure reflections 

x[ = —Xi x'2 = —X2 x'^ = —X3 x'q = Xo 



Case 1 Then 



cos56' + ifT3sini6i (28) 
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Here 



commutes with as and /3. 



Then 



(^3 





(73 



as required. 



S* = cos ^6 — ia^ sin ^9 

S*pS = (3 
S*a°S = a° 
S*a^S = 



S*a^S = cos 6* + sin o? 
5* a^S = - sin 61 cos 6»a2 



Case 2 
Here 



5 = 5* = coshi6' + a3sinhi0 (29) 

S*I3S = p 
S*a^ S ~ 
S*a^S = 

S*a^S = cosh 6' + sinh^ a° 
S*a°S = sinh6i + cosO a° 



Case 3 



S = S*=p (30) 
Note that in aU cases S is ambiguous by a factor ±1. So in case 1 a rotation though 360° gives S = —1. 



Problem 1 Find the S corresponding to a general infinitesimal coordinate transformation. Compare and 
show that it agrees with the exact solutions given here. 



The ipaS transforming with these S'-transformations are called spinors. They are a direct extension of the 
non-relativistic 2-component spin-functions. Mathematical theory of spinors is not very useful. In fact we 
find always in practice, calculations can be done most easily if one avoids any explicit representation of the 
spinors. Use only formal algebra and commutation relations of the matrices. 

The covariant notation 

In order to avoid distinction between covariant and contravariant vectors (which we have also unjustifiably 
ignored in the previous discussion) it is useful to use the imaginary 4*^ coordinate 

X4 = ixo = ict (31) 

In this coordinate system the four matrices^ 

71,2,3,4 = H/3a'' i.e. (32) 
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= J' ^' = [_'i I J' ^^ = U J' ^'^Vo -0^ -V 

are a 4- vector. They are all Hermitian and satisfy 

7/i7i/ + 7i/7/i = ^^i^u (33) 
The Dirac equation and its conjugate may now be written 

E'^ dip mc 



4 o-r 



with 



^7.--V = (34) 



= and (35) 



= « (V'7/^ V') = (^^ 3,ip^ 



These notations are much the most convenient for calculations. 



But 

while 

So 



Thus 



(36) 



ih^ = HiP (37) 



Conservation Laws. Existence of Spin. 
The Hamiltonian in this theory is^ 

dt 

3 g 

H = -ihc^ a'' h = -ific a • V + (38) 

^ OXk 

This commutes with the momentum p = —ifiV. So the momentum p is a constant of motion. 
However the angular momentum operator 

L = r X p = -ihr x V (39) 

is not a constant. For 



[H,L]=-h^ccxxV (40) 
[H, cr] = —ihc V • [a, cr] where cr = (ai, (J2, cts) 
[a^,cri] =0, [a^,(T2] =2ia^, [a^jcrs] = —2io?', etc. 

[H, 0-3] = 2hc (a^ V2 - a^Vi) and thus 

[H,a]=2ncaxV (41) 
L + IhxT = hJ (42) 
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is a constant, the total angular momentum, because by (40), (41) and (42) 

[iJ,J] =0 

L is the orbital a. m. and ^hcr the spin a. m. This agrees with the N. R. theory. But in that theory the 
spin and L of a free particle were separately constant. This is no longer the case. 

When a central force potential V{r) is added to H, the operator J still is constant. 



Elementary Solutions 

For a particle with a particular momentum p and energy E, the wave function will be 



tp{x, t) = u exp ( i 



p-x .Et\ 



h h J 

where m is a constant spinor. The Dirac equation then becomes an equation for u only 

Eu = (c a • p + m(?0) u 

We write now 

p+ = pi + ip2 p- =pi- ip2 

Then (44) written out in full becomes 

(E — mc^) ui = c {P3U3 + P-U4) 



(43) 

(44) 
(45) 



(E - mc^) U2=c {p+U3 - P3U4) 
(E + mc?) us = c {p3Ui + P-U2) 



(46) 



[E + rac^) U4 = c {p+ui - P3U2) 

These 4 equations determine M3 and U4 given m and U2, or vice- versa. And either ui and U2, or U3 and U4, 
can be chosen arbitrarily provided that^ 



E' = TO^c^ + c^p^ 



(47) 



Thus given p and E = -\-\/m^d^ + c^p^, there are two independent solutions of (46); these are, in non- 
normalized form: 

/ 1 \ / \ 





cp-i 



E + mc^ 
cp+ 

\E + mc^ I 



1 

cp- 



E + mc^ 
-CPs 
\E + mc'^/ 



(48) 



This gives the two spin-states of an electron with given momentum, as required physically. 



But there are also solutions with E = —^rr?c^ -\- (?p^. In fact again two independent solutions, making 
4 altogether. These are the famous negative energy states. Why cannot we simply agree to ignore these 
states, say they are physically absent? Because when fields are present the theory gives transitions from 
positive to negative states. e.g.H atom should decay to negative state in 10~^° sees, or less. 

Certainly negative energy particles are not allowed physically. They can for example never be stopped 
by matter at rest, with every collision they move faster and faster. So Dirac was driven to 
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The Hole Theory 

All negative-energy states are normally filled by one electron each. Because of the exclusion principle 
transitions of ordinary electrons to these states are forbidden. If sometimes a negative energy state of 
momentum —p energy —E is empty, this appears as a particle of momentum p energy +E, and the opposite 
charge to an electron, i.e. an ordinary positron. 

Thus we are led at once to a many-particle theory in order to get sensible results. With spin-0 particles, 
to get positive probabilities. With spin-i particles, to get positive energies. 

The Dirac theory in its one-particle form cannot describe properly the interaction between several 
particles. But so long as we are talking only about free particles, we can describe them with one-particle 
wave- functions. 



Positron States 

So which wave- function will describe a positron with momentum p and energy E7 Clearly the wave function 
should be of the form 

(p • X Et\ 
— ^^^^ 

as always in quantum mechanics. But the negative-energy electron whose absence is the positron has a 
wave-function 

p-x .Et\ 



ip{x,t) = uexp ^—i^^ — bi—j (50) 

since it has a momentum —p energy —E. 
Thus we must take 

<^ = CV'+, i.e. v = Cu+ (51) 
where ip~^ is tjj with complex conjugate elements but not transposed, and C is a suitable constant matrix; 



ip'^{x, t) = exp 



p-x .Et\ 



We know that m is a solution of 

Eu= (ca-p- mc^0) u (52) 

We want the theory to make no distinction between electrons and positrons, and so v must also satisfy the 
Dirac equation 

Ev = (c a • p + m<?0) v 
ECu+ ={ca-p + mc^0) Cu+ (53) 

But from (52) we have for u+ the equation 

Eu+ ={ca+-p- mc^l3+) u+ (54) 

In order that (53) and (54) be identical we should have 

Ca'=+ = a''C, C(3+ = -(3C (55) 

Now in fact 

Therefore a suitable C will be 

C = -il5a'' = ^2=( 0° J °o") (56) 
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The relation between ijj and is symmetrical because 

C^ = I Hence = C(j)+ (57) 

The <p is called the charge-conjugate wave- function corresponding to the negative-energy electron tjj. Clearly 

(t)*(t) = (CV+)* (CV'+) = ip^C*Cip+ = ip* {C*Cf ip = ip*i) (58) 

And 

(j)*a''(j) = ip'^C*a''Cip+ = 7p*Ca'''^Cij = i}*a'''^ (59) 

Thus the probability and flow densities arc the same for a positron as for the conjugate negative electron. 

For many purposes it is easier to represent positrons directly by the ip wave-function, e.g. in computing 
cross-sections for pair creation and so forth as we shall do later. But if you actually want to see the positron, 
e.g. in describing the details of a positronium experiment, it is necessary to use the <j) wave-function to 
represent e.g. the way the spin is pointing. 

This is all we shall say about free electrons and positrons. 

Electromagnetic Properties of the Electron 

Given an external (c-number) electromagnetic field defined by the potentials 

/i= 1,2,3,4 Ai = i^ 

given functions of space and time. Then the motion of a particle in the field is found by substituting in the 
free-particle Lagrangian 

E + e $ ior E 

P+-A for p (60) 
c 

where (— e) is the electron charge. We write the momentum-energy 4-vector 

P = {Pi, P2, P3, P4 = iE/c) (61) 

Then we have to substitute simply 

Pii + - for (62) 

Now in the quantum theory 

p, - -^h^^ (63) 

Therefore the Dirac equation with fields is 



In the non-covariant notations this is 

3 



-e i> + ^ I -ihc + eAk j a'' + mc^P 



1 



tp (66) 
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since by (57), we have ip^n = "4^*13^^ = (C(/)+) (3^^ = cjy^C'^ Pj^; the wave function (j) = Ctp'^oi a positron 
satisfies by (65) 

Multiplying by C/3 this gives 

This is exactly the Dirac equation for a particle of positive charge (+e). We have used 

C/?7j/3C = -7^, (69) 

which follows from (15), (32), and (55). 

The Hydrogen Atom 

This is the one problem which it is possible to treat very accurately using the one-electron Dirac theory. 
The problem is to find the eigenstates of the equation 

Eip = Hip 

H = -ihc a • V + mc^/J (70) 

r 

As in the NR theory, we have as quantum numbers in addition to E itself the quantities 

jz = -i[rx V]3 + iaa (71) 

.7(.7 + l) = J'= H(rx V) + i^]' (72) 

where jz and j are now half-odd integers by the ordinary theory of angular momenta. These quantum 
numbers are not enough to fix the state, because each value of j may correspond to two NR states with 
£ = i ± |- Therefore wc need an additional operator which commutes with H, which will distinguish 
between states with a parallel or antiparallel to J. The obvious choice is 

Q = cr-J 

But [H, a] is non-zero and rather complicated. So it is better to try 

Q = f3a-J (73) 

which is the same in the NR limit. 
Then we have 

[H, Q] = [H, pa-J] = [H, f3a]-J + (3cr- [H, J] 

But [H, J] = 0; furthermore, since 

a'^Pai = Pa^a'' k ^ i and a'^Pak = -Pa^a'^ 

we get 

3 

[H,pcr] = -ihc {{a-V)pa - per (a- V)} = -2ihcY^a^UkpVk 

fe=i 

Therefore 

3 

[H,pa]-J = -2hc^a^akpWk{r x W)k - ihc{oL -W) p = -ihc{oL -V) p = [H,\P] 

k=l 
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V • r X V = and a'^ak = ( *? j.] for all fc's. 



because 

Hence the quantity which commutes with H and is a constant of the motion is 

K = ptT-J-^P (74) 
There must be a relation between K and J. In fact 



h ) h 

Hence 

^^ = J^ + i = (i + i)' (75) 
Therefore K has integer eigenvalues not zero, 

K = k = ±{3 + \) (76) 
j = |fc|-i, A: = ±l, ±2, ±3, ... (77) 

Using the eigenvalue for K, we can simplify the Hamiltonian, which we could not do as in the NR theory 
with the eigenvalue of L"^ alone. First 

(7 • r <T • (r X V) = icr • (r X (r X V)) = i (cr • r) (r • V) - ir^a • V (78) 

Let now 

e = -ia^a^a^ uk = eot (79) 

Then multiplying (78) by we get: 

( 9 

io: • V = a • r cr • (r X V) — i a • r ( r— 

' or 



Let ar = - cx • r, then by (39) and (42) 



1 9 1 d 

-ioL-V = —ar {id • J — |i) — iar-r- = - (*/3i^ ~ ~ ictr^r 
r ' or r or 



Thus finally we can write (70) in the form 



= h i?ic q, [ ■ — (80) 



This gives the Dirac equation as an equation in the single variable r, having separated all angular variables. 
For the solution of this equation, see - Dirac, Quantum Mechanics, Third Edition, Sec. 72, pp 268-271. 

Solution of Radial Equation 

We may choose a two-component representation in which 
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Then 



(E - mc^) u = u + hc{ 1- 

\ / J. \ r or J 

(E + mc^) v = v + hc\ —] '. 

\ I J. \ r or J 



(82) 



Let now 

-E + mc? E + m(? 



ai = ; a2 = r ; " = t- , (83) 

nc nc nc 



the fine structure constant. Then 



/ a\ _ f-l + K _ d 
V r) \ r dr 



(84) 



Next put a = y^oioi = Vm'^c'^ — E'^/hc which is the magnitude of the imaginary momentum of a free 
electron of energy E. Then if) ^ e""'" at infinity. Hence we write 

p-ar 

u = (85) 



r 

—a 

V = g 



So 



dr ^ r , 



(7--)/ 



(86) 



Now we try solutions in series 
This gives 



f = Y,Csr\ 9 = Y.dsr^ (87) 



Putting 



acs- aiCs-i = -ads-i + {s + k) dg 
ads + a2ds-i = +acs-i + {-s + k) Cg 



Cs = aiCg-i — ads-i we have Cs = a Cg — (s + k) ds = — (a dg + (s — k) Cs) 

a 

aia + a{s + k) ^ aa — ai{s — k) 



(88) 



(af — a^) a + 2saai 

^s+i — 2 I r"2 PIT 
aia^ + ai (s'^ — fc^) 
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Suppose the series do not terminate. Then for large s 



hence / « exp(2ar) 



Cg Cg S 

This is permissible when a is imaginary. Thus there is a continuum of states with 

E > mc^ (89) 

For real a the series must terminate at both ends in order not to blow up at infinity. Suppose then^ Cg is 
non-zero for 

s = e + l, e + 2, ... e + n n>l (90) 

and otherwise zero. This gives 

a^ + e^-k^ =0 

(a^ — a^) a + 2 (e + n) aai = 

Now not both and are zero, thus the wave function r^^+'^ must be integrable at zero. This gives 
e > -5. But e = ±VA;2 - a^. Now P > 1, hence VW^^c? > 5, and 

e = Wk'^ - o? (91) 

Also 9 



((mc2-S)'-(mV-S2)y 



2 A.E'^o? 



4 (m2c4 - E-^) {mc^ - sf j 4 (m^c* - £;2) 



^2 



1 + 



Hence in this case 

E=- 

J1 + 



E= , (92) 



{n + y/k'^-a^y 



Given this positive E, (a^ — a^) is negative (see (83) and (84)) and so it is allowable to square (e + n) to 
find these solutions, without introducing any difficulties. So for each 

fc = ±l,±2,±3,... (93) 
n = 1,2,3, ... 

solutions exist, with E given by (92). 

The alternative possibility is that all eg are zero. Suppose not both of and are zero. Then 
+ — A;2 = as before and so e = \fW^^c?. But now 

fliCc — arfe = 
a Ce — (e + k,)di^ = 



Hence aa — ai{e + k) =0 and k must be positive to make e + k = \/k^ — oi} + > 0. After this the solution 
goes as before. So solutions (92) exist for 

n = 0, fc = +l,+2,+3, ... (94) 
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The principal quantum number N is 
Expanding in powers of a 



E = mc^ 



N = n+\k\ 



1 



2]V2 + ]V3 \8N 2\k\ 



(95) 



NR levels fine structure 

There is exact degeneracy between the two states of a given |A;|. Nonrelativistic states are given by 

j=£-^^k = +£ 



So 



'Pl/2 


is 


J = 


1 

2' 


k = l 




is 


j = 


1 

2' 


k = -l 


^"^3/2 


is 


j = 


1 

2' 


k = -2 



degenerate 



Behaviour of an Electron in a Non-Relativistic Approximaf.ion 

( d e 

Multiplying the Dirac equation (64) by ^^^v I ^ h 

\ nc 



we have 



dxu he 



2 2 

m c 



fi2 



-^- = 



Using 7^ = 1, ^^t'^v + lulu = this gives 

E 

Here 



he 



9 ? 

m c 



Thus Fi2 = i?3 magnetic field component 

Fi4 = iT^ + -^ = -tEi 

oxi e at 

(Ji4 = iai 



^ _ dA, OA^ 

dXa dXv 



electric field 

spin component 
velocity component 



Thus (97) becomes 



E 



(96) 



(97) 



(98) 



This is still exact. 
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Now in the NR approximation 



ih— = mc^ + 0(1) 



dt 



—tn— — e4> — mc —in— — e<P + mc 



fi2c2 {\ dt J \ dt 



Hence 



The NR approximation means dropping the terms O (l/mc^). Thus the NR Schrodinger equation is 

,n^^U-e^-^±(l-_+!fAX + ^J..H-i<..E)\* (99) 



dt 1 2m ^ V 9a;fc /ic / 2mc 

V fc=l y 

The term ex- E is really relativistic, and should be dropped or treated more exactly. Then we have exactly 
the equation of motion of a NR particle with a spin magnetic moment equal to 

pti 

M = --P-a (100) 
2mc 

This is one of the greatest triumphs of Dirac, that he got this magnetic moment right out of his general 
assumptions without any arbitrariness. 

It is confirmed by measurements to about one part in 1000. Note that the most recent experiments 
show a definite discrepancy, and agree with the value 

M = -^all + ^\ (101) 
2mc \ 2TThc J ^ ' 

calculated by Schwinger using the complete many-particle theory. 



Problem 2 Calculate energy values and wave functions of a Dirac particle moving in a homogeneous infinite 

magnetic field. Can be done exactly. See F. Sauter, Zeitschrift fiir Physik 69 (1931) 742. 

Solution 

Take the field B in the z direction. 

Ai = -\By , A2 = \Bx 

The second-order Dirac equation (98) gives for a stationary state of energy ±i? 

£;2 m^c^ \ f d lieB V f d lieB V d^ eB 



fi2(;2 fj^ J \dx 2 he J \dy 2 he J dz^ he 

Taking a representation with diagonal, this splits at once into two states with = ±1. Also 

r -J ^ ^ 
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d 

is a constant of the motion, say = ih where £ is an integer. And —ih— = p^. Let A = leBhcl. Then 

oz 



This is an eigenvalue problem with eigenvalues of a two-dimensional harmonic oscillator. 
Thus 

= m^c^ + c^vl + \{n±{£,± 1)} 

where 4 = 0, ±1, ±2, . . . , ±(n - 1). 
So the eigenvalues are 

E = ^/rrfi^T^^^+M\eBhc\ with M = 0, 1, 2, . . . 

The lowest state has energy exactly mc^. 
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a'' a 



Summary of Matrices in the Dirac Theory in our notation 

(7fc)* = Ik 



25ul a" (3 + /3a'= = 
7fe = -il3Q^ = ip-yk 74 = /? lij-lu + >7/i 

a'^'ye - na'' = 2i5ek(3 75 = 7i 727374 7/:*75 + 757^ = 



25^.1 



We use the following representation: 



/3 



Ik = 



-lOk 



lUk 





\ 






^0 


—I 






/I 


^ 


















= ( 







) 


0-3 






) 












(0 





1\ 








/o 













1 


o\ 












1 











i 







3. 








-1 




I.e. = 





1 














-i 





a = 


1 

















0) 








V 





0) 








-1 



























/o 





-1 


o\ 




.e. /3 = 74 







1 









/o 


-t 













-1 









- 


1 







^5 = I -I 






-1 

















vo 





-V 
















1 


0^ 






/o 





- 










/ 


- 


-A 






/O 


—i 


0\ 










—i 














1 

















i.e. 71 = 





i 








72 




1 








73 




i 


























[-1 











\i) -i 







a'' = eak 


V 


= ie/3 




e = 


— ia^ 




e^ = 




= I 


75 = -e 







i7fe = ??7fe 



akcre = aktti 



Ik = rjCTk 

1 

10 

\o 1 oy 

- 25ike 



-ir]p 



12 3 

r] = —a a a 



/O -i 
O -H\ _ 
il O J ~ i 




Jkcre 



k 



OikOt 

(^kle ■■ 



■- larr. 



Comparison with the Dirac notation: 



\0 i 

f o-Hk = ^SikV I3ak - (JkP = 
k,i,m= (1, 2, 3) cyclicly permuted 
ea" = 7jue + £7^ = CTfeC - eak = 

ikv - nik = (^kv - wk = (3v + = 

k,£,m = (1, 2, 3) cyclicly permuted 
Pi = e P2 = ri P3 = 0- 



-z 


0/ 



Latin indices: 1, 2, 3. Greek indices: 1, 2, 3, 4. 



THE DIRAC THEORY 



21 



Summary of Matrices in the Dirac Theory in the Feynman notation 

ak<T( + creak = 2(5^^1 



I 



Representation: 



75 = i7o7i7273 7^75 + 757/* = a'' 75 - 75"'' = 



7| = 



= -I 



a = 



<7k 



Ik 



<7k 



Pi 



CTl 



.1 a 



A 






1 Oyl 




0-2 




/O 





1^ 







1 







1 










V 





0^ 





0-3 



1 
-1 



/o 










0/ 



a = 



/o 



1 



o\ 
-1 



0/ 



i.e. ^ = 70 



I.e. 71 



/ 



-1 

V-1 



1\ 




1 



/O 1 



10 

\0 1 0/ 





1 


0/ 



75 



72 



/I 













1 














-1 













-1/ 


(' 








-^^ 








i 








—i 
















0/ 



P2 



O -il 
il O 



73 



/O 



1 
\0 i 







i ^ 


0/ 



1 





1 



-1 



0/ 









(Tfe = pia'' a'' = pKTfe p2 = pi = -ia^cy^o? p\ = p\=\ 

o-fe = -ip-rik 7fe = «P20-fc p\ = -ip2P P2 = -a^a^a^P 
ctui + cr^a'" = l^ikPx 7feO'« + o-<?7fe = -"^^ikP^ l3crk - (TkP = 
akCTi = ttkae = - jkje = icTm k,£,m= (1, 2, 3) cyclicly permuted 
a'' Pi - Piot' = 7mPi + Piln = cTkPi - Pio-fe = 

q'^P2 + P20L^ = lkP2 - P2lk = (^kP2 - p2(Tk = /9p2 + p2/3 = 

akcre = ittm \ 

(^kll = ilm \ k,£,m — (1, 2, 3) cyclicly permuted 
7fea^ = iPcTm J 

Latin indices: 1, 2, 3. Greek indices: 0, 1, 2, 3. 



Scattering Problems and Born Approximation 



General Discussion 

The problem of scattering of a Dirac particle by a potential can be treated exactly by finding the continuum 
solutions of the Dirac equation. This is a complicated business even for the simplest case of a Coulomb 
force. It was done by Mott, Proc. Roy. Sac. A135 (1932) 429. 

For most purposes in relativistic problems, and always when the scattering is produced by complicated 
effects involving radiation theory, one uses the Born approximation. That is, wc treat the scattering only 
to I''* order in the interaction, or only to some definite order in which we are interested. 

The formula for scattering from an initial state A to a final state B lying in a continuum of states is, 
transition probability per unit time, 

w = ^Pe\Vba\^ (102) 

This you ought to know. Pe = density of final states per unit energy interval. Vba is the matrix element 
of the potential V for the transition. Here V may be anything, and may be itself a second-order or higher 
order effect obtained by using higher-order perturbation theory. 

The difficulties in real calculations usually come from the factors 2 and tt and the correct normalization 
of states. Always I shall normalize the continuum states not in the usual way (one particle per unit volume) 
which is non- invariant, but instead 

o 

mc 

One particle per volume -— — (103) 

1^1 

where is the energy of the particles. Then under a Lorentz transformation the volume of a fixed region 
transforms like 1/\E\ and so the definition stays invariant. 

Thus a continuum state given by the spinor tjj = uex.p{{ip • x — iEt)/h} is normalized so that 

\E\ , , 

u*u = (104) 

Now if we multiply the Dirac equation for a free particle, (44), by u on the left, we get Eu*(3u = cu*f3a. • 
pu + mc^u*u] its complex conjugate is Eu*(3u = —cu*(ia • pu + mc^u*u since (3cx is anti-Hermitian; then 
by adding we get 

Euu = m(?u*u (105) 

Therefore the normalization becomes 

uu = +1 for electron states 
= — 1 for positron states 

With this normalization the density of states in momentum space is one per volume h'^ of phase space, that 
is to say 

1 mc^ 

per volume dp-idp2dps of momentum space, for each direction of spin and each sign of charge. Now we have 
again an invariant differential 

d^^d^ (108) 
\E\ 



This is the definition of e. (106) 
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Projection Operators 

Usually we are not interested in the spin either of an intermediate or of an initial or final state. Therefore 
we have to make sums over spin states which are of the form 

S = Y^ {sOu) {uPr) (109) 

2 

where O and P are some kind of operators, s and r some kind of spin states, and the sum is over the two 
spin states u of an electron of momentum p and energy E. 

Let us write 

f = '^Pi^Jn, Pi = iE/c (110) 

The Dirac equation satisfied by u is 

- imc) u = (111) 
The two spin states with momentum 4-vector —p satisfy 

(^ + imc)M = (112) 

As one can easily show from (48), these 4 states are all orthogonal in the sense that {u'u) = for each pair 
u'u. Therefore the identity operator may be written in the form 

I = ^ (uu) e (113) 

4 

summed over all 4 states with e defined as earlier. Hence by (111) and (112) we can write (109) as 

S = Y, (^0 ^^.^^ eu\ {uPr) = [sOK+Pr) (114) 
by virtue of (113); here the operator 

A+ = 115 

+ 2imc ^ ' 

is a projection operator for electrons of momentum p. 

In the same way for a sum over the two positron states u with momentum p energy E 

S = ^{sOu){uPr) = {sOK-Pr) (116) 

2 

with 

i) — imc 

A_ = ^ 117 

2imc ^ ' 

and we have 

A+ - A_ = I (118) 

These projection operators are covariant. In Heitler the business is done in a different way which makes 
them non-covariant and more difficult to handle. 

Note that here charge-conjugate wave functions are not used. The positrons of momentum p are repre- 
sented by the electron wave functions u of momentum —p energy —E. 



SCATTERING PROBLEMS AND BORN APPROXIMATION 



24 



Calculation of Spurs 

Suppose we have to calculate an expression such as 

^ ^ ^ {upOui) {uiOuf) 

I F 

summed over electron states only. This gives 

5 ^2 {ufOA+OA+uf) e 
summed over all four spin states up. To calculate this, let us consider the general expression 

c (uQu) 

u 

summed over all 4 spin states, where Q is any 4x4 matrix. 

Let Q have the eigenvectors ^1,^2, W3, with eigenvalues Ai, A2, A3, A4. Then 

4 

k=l 

and 

e (uQu) = ^ e ^ Afe (uwk) {w*u) = ^ XkW* ^ e (uu) > Wk 

U U k = \ fe = l V M ) 

Hence by (113) 

u 

Now X] A = sum of diagonal elements of Q = Spur Q. Thus^° 

^ e {uOu) = Sp Q 

and this is always easy to calculate. 



Problem 3: Given a steady potential V a function of position, and a beam of incident particles, electrons. 
Solve the Schrodinger equation in the Born approximation 

a) By stationary perturbation theory 

b) By time-dependent perturbation theory. 

Show that the results agree, with a transition probability per unit time given hy w = {2t^ /h)pE\VBA\'^ ■ 
Evaluate the cross section in the case V = —Ze^/r, averaging spin over initial state and summing over final 
state. 

c) Repeat the calculation with particles obeying the Klein-Gordon equation, leaving out the term, 
by either method. Compare the angular distribution in the two cases. 



Problem 4: A nucleus (O^^) has an even^^ j = ground state and an even j = excited state at 6 MeV. 
Calculate the total rate of emission of pairs, and the angular and momentum distributions. 
Solution: Let AE be the excitation energy, pN and j ^ the charge and current density operators of the 
nucleus. Then for the transition we are interested in pjv and jj^ are functions of position r with the 
time- variation of the single matrix element given by exp{—iAE/h}. Also 
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The electrostatic potential V of the nucleus has the matrix element given by 

VV = -A-kpn (120) 

The states being spherically symmetric, pjv is a function of r only, and so the general solution of Poisson's 
equation simplifies to^^ 

V{r)=-— f rlpN{ri)dr^ (121) 
^ Jo 

Outside the nucleus V{r) = Ze? jr is constant in time, and so the matrix element of V^(r) for this 
transition is zero. In fact from (119) and (120) we get by integration 

V{r) = ^(-47r)(-r)i;vo(r) = ^ijvo(r) (122) 

where Jno is the outward component of the current. 
The interaction which creates pairs is then 

-^jNo{r){-erW))dr (123) 

As an approximation consider the de Broglie wavelengths of all pairs long compared with the nuclear size. 
Then 

I = rm^ I rjNo{r)dT (124) 

The constant / rjNo{f)dT is not known exactly. Suppose as an order of magnitude estimate that the 
nucleus of charge Ze is uniformly spread over a sphere of radius Tq in the ground state, and also in the 
excited state. Since pN is roughly uniform inside the nucleus we have by integrating (119): 

JN = -i^rpN , and thus 

I = rm J r-Viv(r) dr = rm (^) Qe-'^'^'^'' (125) 

Q is roughly a measure of the charge-moment of inertia of the nucleus, and is equal to 



Thus 

I = {rm} e-'^^'/^ (126) 



So the problem is just to compute the probabilities of pair-emission with this interaction. Note that real 
radiation is strictly forbidden in a 0-0 transition, and so these pairs are actually observed in the reaction 

p + Fi9^0lQ + a^ Oi6 + e+ +a (127) 

Is it correct to take for the interaction just 

J V{r) {-etp*r dr 

taking the Coulomb potential of the nuclear charge and ignoring all electrodynamic effects? Yes. Because 
in general the interaction would be 

y" I (-e^» - ^ Afc (-e^*afc V) I dr (128) 
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where A^ are the scalar and vector potentials satisfying the Maxwell equations 

VV+-V- ^ = -A-KPN 
c at 

The matrix element of the interaction (128) is unchanged by any gauge transformation of the {A,ip). 
Therefore we may take the gauge in which 

V • A = 

Incidentally, since Lp = V{r)^ the second Maxwell equation reduces to 

2. 1 d'^A _ 47r . 

Now, since there is no free radiation present, also V x A = 0, and hence ^4 = 0, in this gauge, and therefore 
we can indeed ignore all electrodynamic effects. 

Let us calculate then the probability of pair emission with the interaction (126). A typical final state 
has an electron of momentum pi and a positron of momentum p2, with energies Ex, E^ and spins u\, U2 
respectively. For the creation of this pair the matrix element of I is 

I = -CuiPu2 , C = ^^-^ (128a) 
The density of final states is by (107) 

J_ 711 C 

p\dpi dcui p\dp2 du}2 (129) 



{2Trhf E1E2 



where dui and du2 are the solid angles for pi and p2. The creation probability per unit time is thus by 
(102) 

277 Pe 2 27r dpidp2 „2 1 m'^c'^ pfplduJi duJ2 s;^ r, ,2 .,on^ 
dp2 dp2 E2 



w = 



h dE' ' hd{Ei+E2) {2nhf E1E2 

Now fixing pi , 



d{Ei + E2) dE2 c^P2 



and 



Y: I U10U2 f = Y: («i/?"2) {u2Pu,) = Spur I P p I 

= _1 + Pi 'P2 ^ -^1^2 ^ E1E2 - m^c^ + c^piP2 cose 
where is the angle between the pair. Then writing in (130) 

dE\ = dpi , dui = 47r, du2 = ^nsinOdO 
El 

we obtain the differential probability^^ in Ei and 6 

Wo = 2r,^^iff '^^i (-^1^2 - m^c^ + c^PiP2 cos e) sin dO (131) 
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Since 

AE = 6MeV= 12 mc^ 
we can to a good approximation treat all particles as extreme relativistic. Thus 

472g4„4 

Wo = ° El El dEi (1 + COS e) sin 9 dO (132) 

So the pairs have an angular distribution concentrated in the same hemisphere, and predominantly equal 
energies. Then, since 

/ (l + cos(9)sin6lrf6l = 2, and / EfEldEi= Ef{Ei+AEfdEi = —{AEf 
Jo Jo Jo 15 

the total creation probability per unit time is 



'iZ^e'^rt 1 

Wt = 

Numerically 



Ze^ 1 AEro 1 _,o 

— — ~ — and — - — ~ — since Tq = 4 x 10 cm 
he 17 he 10 



Hence the lifetime will be 



T = 15 X 257r X 10^ X 17^ X i X — = — « lO'^^ sec. (134) 

4 c c ^ ' 



Scattering of Two Eleetrons in Bom Approximation. The M0ller Formula. 

We calculate now the transition scattering matrix element M between an initial state A consisting of 2 
electrons with momenta pi,P2 and spin states Ui,'U2 and a final state B consisting of two electrons with 
momenta p[ , p'2 and spin states u[ , u'2 . Thus M gives the probability amplitude for arriving in state B after 
a long time when the system is known to be in state A to begin with. Hence M itself should be an invariant 
relativistically. 

We treat the interaction in Born approximation i.e. consider the particles to go directly from the free- 
particle state A to the free-particle state B by applying the interaction operator once to state A. For 
electrons at reasonably high or relativistic velocities this will be a very good approximation [e^ /hv <C 1). 
Also we treat the electromagnetic interaction classically, just as in the O"'^^ problem, taking the field produced 
by particle 1 according to classical Maxwell equations to act directly on particle 2. This ignores the fact 
that the field consists of quanta. We shall see later, after we have developed the quantum field theories, 
that this introduces no error so long as we are in the Born approximation. 

For the field produced by particle 1 in a transition from the state to we have the matrix 

elements <^(i), say. We use now not the gauge in which V • A = 0, but the covariant gauge in which^^ 



8 A 

" = Ai=iip (135) 



So using covariant notations we have in this gauge 

92 



E 



dxl 



Afi{i) = +4:Tres^^i) (charge is - e) (136) 



= i (wi'7/iWi) exp -j ^ {Piu - p'l^) x„ ^ (137) 
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whence 



^ij. (1) = -^meh^ 



(mi7^ui) exp jEi. ^ {Piv - P'lv) 



where 



(138) 



(139) 



The effect of the field (138) on particle 2 is given by the interaction term in the Dirac equation for 
particle 2 

-eip + ea- A = ie(3^-fi^A^ (140) 
This gives for particle 2 for the transition from state p2, U2 to ^2 ^ transition matrix element 



7/i-4^ (1) -^2 



(141) 



a 3-dimensional integral over space at the time t say. For the total transition matrix element M by first 
order perturbation method 



M 



= dt J rfr ^2^ieE7/^^/^(i)^V'2 = J d^xtp^ V'2 (142) 



where the 4-fold integral is dxi dx2 dx^ dxo, xq = ct. Putting in the values of A^ {i),'4>'2 and ^^2, we get 



M 



■ E (■"27/:*M2) (wh/iWi) 



/ ^2 



d^a; exp ^ E t (PiA - P'lx + P2a - P2a) 



{2TTh)H\pi+P2-v'l-P2) 



(143) 



where (5'*(.x) = Ofc^i <5(a;fe)- 



There is also the exchange process in which the particle pi,ui goes to P2>^2 ^^'^ vice-versa. This gives a 
contribution to M with a minus sign since the wave function ought to be taken antisymmetric between the 
two particles. Hence the final result is 



M = -^!^{2^hf6\p, +P2 - Pi -P^)E 



{U2IHU2) {u[jfj,Ui) (m27m"i) {uiliJ.U2) 



{Plu - P'l^) 



(144) 



This covariant formula is elegant and easy to arrive at. The question now is, how does one go from such 
a formula to a cross-section? 

Generally, suppose in such a 2-particle collision process the transition matrix is 



M = K{2T,h)H\pr+p2 - p\ - p'2) 



(145) 



Then what will be the cross-section in terms of Kl We do this calculation once here so that later we can 
stop when we have found formulae for M of the type (145) which come for example conveniently in this 
form out of radiation theory. 
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Relation of Cross-sections to Transition Amplitudes 

Let w be the transition probability per unit volume and per unit time. This is related to the transition 
probability for a single final state, which is 

Ws = c\K\''{2nh)H\pi +P2- p[ - p'2) (146) 

since in |Mp one of the two (27rfi,)^(5'*(pi + P2 — p'l ^ P2) /c factors represents merely the volume of space- 
time in which the interaction can occur. The number of final states is by (107) 

„2 

- dp'n dp'12 dp'is dp'21 dp'22 dp'23 (147) 



(27rfi)6 \E[\ 

Multiplying (146) by (147) gives the total transition probability 

^ " l^l^ (27r;t)2 :E[^^^^(^^ +P2 -P'l -P2) rfp'n dp'i2 <^P'l3 *21 '^^22 *23 (1^8) 

As 5{ax) = -5{x), we have 
a 

S\pi +P2- p[ - p'2) = SHp^ +P2- p[ - p'2)c6{E, +E2-E[- E'^) 
and the integration over dp2 gives then by the momentum conservation 

2 2 4 

w = 1^1' ^ '^(^^1 +E2-E[- E'2) dp[, dp[2 dp'is (148a) 



Furthermore, 
and thus 



if /(a) = 0, we have fix) = f{a) + f{a){x - a) = f{a){x - a) 

5{x — a) 



Sifix)) = S{f'{a){x - a)} = 



f'ia) 



Applying this to (148a) with /(x) = /(p'ls) = Ei + E2 — E[ — E2 and a = (^'13)0 = the value of p'^g giving 
momentum and energy conservation, we get 

5iE, +E2-E[- E',) = ,^E.^E2-E[-E'2) " ^^-^^^ 

dp'xz 

Hence we finally obtain 

= |>>-|2 "^^C^ ^Pll dp'i2 dp'13 

^ ' ' E[E'2 {2T^hY d{E[ + E'^) 

Choose a Lorentz-system in which pi and p2 are both along the X3-direction and take p'^^ and p'12 as the 
variables over which the transition probability is taken. This is necessary for relativistic invariance. Then 
p'li and p'i2 being fixed and having from the momentum conservation p'^^ = — P23 we get 



d{E[+E[) 



dp[ 



13 



dE[ dE'2 



dp'i3 dp'23 



-^1^2 



Then the cross-section a is defined in this system by 
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where Vi is the normahzation volume for particle 1, and vi its velocity. In fact by (103) 

Vi = -^ V2 = -^ iv^-V2) = -^--p (151) 
Hence the cross-section becomes 

^ = c^lpiE^-p^i^il = |^^p^3 _ ^^p^3| ^E!,p[, - E[p',,\ i2^nf '^"'^ ^^^^^ 

It is worth noting that the factor P1E2 — P2E1 is invariant under Lorentz transformations leaving the xi 
and X2 components unchanged (e.g. boosts parallel to the Xs axis.)^^ To prove this, we have to show that 
P13E2 — P23E1 = P13E2 — P23E1 (where ~ denotes the quantities after the Lorentz transformation) because 
we have chosen a Lorentz system in which the direction of the momentum vector is the X3 axis. Then 

E = E cosh 6 — cp sinh 6 

p = p cosh 6 sinh 6 

c 

Since E"^ = p^c^ + rri^c^, we can write 

E = mc^ cosh (j) pc = mc^ sinh (j), which makes 

E = mc^ cosh((/) — 9) pc = mc^ sinh(^ — 6*) and thus 

-^2^13 - E1P23 = rri^c^ {cosh {4>2 - 0) sinh (t^i - 6) - cosh {(j)^ - 6) sinh ((/)2 - 0)} 
= vn^c? sinh {(fyi — (j)2) 

independently of 6. Hence we see that a is invariant under Lorentz transformations parallel to the X3 axis. 



Results for M0ller Scattering 

One electron initially at rest, the other initially with energy E = 'ymc^; 

1 

7 = 



v/1 - iv/cy 
scattering angle = 9 in the lab system 

= 9* in the ccntcr-of-mass system 

Then the differential cross-section is (Mott and Massey, Theory of Atomic Collisions, 2"^^ ed., p. 368) 
2na{e)de = An{^\ [ ^ ] dx { ——^ -—--+[ ^ ] + (153) 



with 



Without spin you get simply 




2 + (7 - 1) sin^ 



4^ ( — ^ 1 ( I dx 



72 J ■ \(1-.t2)2 1-x2 

Effect of spin is a mcasiirablc increase of scattering over the Mott formula. Effect of exchange is roughly 
3 

the ^ term. Positron-electron scattering is very similar. Only the exchange effect is different because 

1 — x-^ 

of annihilation possibility. 
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Note on the Treatment of Exchange Effects 

The correctly normalized initial and final states in this problem are 

i={^i(l)V2(2)-Vi(2)V2(l)} 

(154) 

i=W(l)V'2(2)-^i(2)V'2(l)} 

where ^"2(1) means the particle 2 in the state 1, and so on. With these states the matrix element M is 
exactly as we have calculated it including the exchange term. 

The number of possible final states is only one half of the number of states of two distinguishable 
particles. But this does not bring a factor i into the differential cross-section, because the density of 
antisymmetrical states, in which one of the two particles has a momentum in a particular range dpi dp2 dps 
is exactly the same as the density of states of 2 distinguishable particles in which the particle labelled 1 lies 
in the given range. Hence the general rule: the differential cross-section does not have a factor i, the total 
cross section does because each final state may only be counted once in integrating over the angles. 



Relativistic Treatment of Several Particles 

The M0ller treatment of the interaction of two electrons succeeds because the field of particle 1 is calculated 
for all time without taking any account of the effect of particle 2 on particle 1. How can one do a better 
calculation taking such reactions into account? Clearly we must construct an equation of motion which 
follows the motions of both particles continuously in time and keeps them in step with each other. So we 
must have a Dirac equation for 2 electrons, taking exact account of their interaction by including in the 
equation the behaviour of the Maxwell field too. 

This kind of 2-particle Dirac equation is no longer relativistically invariant, if we give each particle a 
separate position in space but all the same time. To avoid this Dirac constructed the many-time theory 
in which each electron has its own private time coordinate, and satisfies its private Dirac equation. This 
theory is all right in principle. But it becomes hopelessly complicated when pairs are created and you have 
equations with nc;w time-coordinates suddenly appearing and disappearing. In fact the whole program of 
quantizing the electron theory as a theory of discrete particles each with its private time becomes nonsense 
when you are dealing with an infinite "sea" or an indefinite number of particles. So we have come to the 
end of what we can do with the relativistic quantum theory of particles. 

Where did the theory go wrong? Obviously a lot of the troubles of arose from the fact that a particle 
was always described by an operator r representing its position at the time t, t being a number and not an 
operator. This made the interpretation of the formalism essentially non-relativistic even when the equations 
were formally invariant. In equations like the K. G. and the Dirac equation, the space and time coordinates 
appear symmetrically. Thus we are led to the following new view-point. 

Relativistic quantum theory is the study of quantities ip which are functions of four coordinates xi, X2, 
xs, xo, all the coordinates being c- numbers and only the expressions containing tjj being operators describing 
the dynamical system. 

The dynamical system is specified by the quantity ^ existing at all points of space-time, and so consists 
of a system of fields. Relativistic quantum theory is necessarily a field theory. 

The process of reinterpreting a one-particle wave-function like the Dirac i/' as a quantized field operator 
is called Second Quantization. 



Field Theory 



Before we can begin on the program of constructing our quantum theory of fields, we must make some 
remarks about Classical Field Theory. 

Classical Relativistic Field Theory 

We take a field with components (vector, spinor etc.) labeled by a suffix a. Let 




(155) 



The theory is fully described by an invariant function of position called the Lagrangian Density, 

^-^(r(,x),</);:(.T)), (156) 

a function of cp" and its first derivatives at the point x. The behaviour of the field is fixed by the Action 
Principle. If O is any finite or infinite region of space-time, then 

7(0) = - [ ^d^x (157) 
c Jn 

is stationary for the physically possible fields (j)". Thus the variation y;" — » 0" + d(j)" produces no change 
in / to first order in 5^", if 5(j)°' is an arbitrary variation equal to zero on the boundary of fi. 

It is always assumed that ^ is at most quadratic in the and is in various other respects a well-behaved 
function. 

Let E be the boundary of fl, and da an element of 3-dimensional volume on S, the outward unit 
vector normal to da, and 

da^ = n^da, ^Z*^^"""*^ M= 1)2,3,4 xq = ct 

M (158) 
dafi = {dx2 dxs dxo, dxi dxs dxo, dxi dx2 dxo, —i dxi dx^ dx^) 

Then 




(159) 



So the principle of action gives the field equations 
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defining the motion of tlie fields. 
The quantity 



c 



is the momentum conjugate to </>", defined at x and with respect to the surface S. 

A more general type of variation is made by varying not only the (p" but also the boundary of O, each 
point being moved to the position (a;^ + Sxf^) where is cither constant or may vary over the surface. 
Writing jv</>" for the new and o^" for the old one, we have 

= Ncl^'^ix + 5x) - o(t>"{x) 
o(l)'^{x + 6x) = o^^ix) o^'^ix) 

A^ix) = N^ix) - o^ix) (162) 

Therefore under the joint variation 

c6I{n)= f ^{Nr{x),Nr^{x))d^x- f ^{or{x),or^{x))d^x 

= 1/ -/ \^{Nr{x),N4>''^{x))d^X^ I {^{Nr{x),Nr^{x))-^{or{x),or^x))]d^X 



/ Vn^fe^^Sf (jv?!'"(a;),jvC(a;)) da + c V7ra(a;)A^"(a;) da 



the latter being true by (159) if we assume (160). 
Now since by (162) 

6r{x) = N^ix) + ^ Sx^ N^^ix) - or{x) = A<^" + ^ Sx^ N<P"^{X) 

H n 

hence we get finally 

(5/(0) ^ j^Y. 1'^"'^'^" + "m-^ - '5^^} d'^ (163) 

with all the new quantities on the RHS. 

In the case which is physically of importance, the actual motion is fixed uniquely by specifying the 
values of the 0" everywhere on two space-time surfaces fT2 and ai which are the past and future boundaries 
of the volume Q. A space-like surface is one on which every two points are outside each other's light-cones, 
so that the fields can be fixed independently at every point. 

Special case of non-relativistic theory, both cti and a2 are just space at the time t\ and t2, and Ja;^ is ic 
times a displacement of the time by 5t\ and 5t2- Then we may write = (0,0, 0,i), ttc = d^/dcf)'^, and 
then for the Hamiltonian 



jdr {^.. 



H= I dr [ >^7r«r-^) (164) 
and thus 

SI{n) = f drY, {{■^M'") (ii) - (TTa-^r ) {ti)} - {H{ti) Sti - H{t2) St2} (165) 

The essential feature of this classical theory is that the Action Principle is stated only for variations 
vanishing on the boundary of il. From this one can deduce as in (163) and (165) the effect on of 
variations not vanishing on the boundary. This is possible because each state of motion is defined by fixing 
as many field quantities as can be fixed independently, (e.g. all the fields on two space-like surfaces or all 
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the fields and their time-derivatives on one surface) and then the whole past and future of the motion is 
determined by the field equations. 

Field equations can be written in the Hamiltonian form 



Examples: 

1. Klein- Gordon Field, real 



dH . dH 



■^- = -I^ME(S^) (166) 



2. Klein- Gordon Field, complex 



where we consider xj) and tj}* independent one-component fields. 
3. Maxwell Field, four component A^, Fermi form. 



4. Dirac Field 

= -ncii (^^^1- + j V = ^ (169) 

5. Dirac Field interacting with Maxwell Field 

^Q=^D+^M-Y. *s^A?^7aV' (170) 

A 

here Q stands for quantum electrodynamics. 



Problem 5. Work out these examples: find the field equations, the momentum conjugate to each component 
of the field, and the Hamiltonian function, (the momenta and Hamiltonian defined for the case of a flat 
space (7 only). Verify that the Hamiltonian gives a correct canonical representation of the field equations 
as Hamiltonian equations of motion. 



Quantum Relativistic Field Theory 

The classical relativistic field theories were usually quantized by using the Hamiltonian form of the field 

equations and bringing in the commutation relations between coordinates and momenta taken from non- 
relativistic quantum mechanics. For this approach see Wentzel's book. It is a very bad method, it is 
complicated; and it is not at all obvious or even easy to prove that the theory so made is relativistic, 
because the whole Hamiltonian approach is non-covariant. 

Just recently we learnt a much better way of doing it, which I shall now expound in these lectures. It 
is due to Feynman and Schwinger.^^ 
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References: R. R Feynman, Rev. Mod. Phys. 20 (1948) 367 

Phys. Rev. 80 (1950) 440 

J. Schwinger, Phys. Rev. 82 (1951) 914 



It is relativistic all the way, and it is much simpler than the old methods. It is based directly on the Action 

Principle form of the classical theory which 1 have just given you, not the Hamiltonian form. 

In the quantum theory the 0" are operators defined at each point of space-time as before. They satisfy 
the same field equations as before, and this is ensured if we assume that the Action Principle 

si{n) = 

iin) = - I ^{r,r^) d^x (171) 

c Jn 

holds for all variations of the operators vanishing on the boundaries of O. 

In the quantum theory, because of complementarity relations, it is not possible to give numerical values 
to all field operators throughout a physical motion. In fact the state of motion is specified by giving 
numerical values to the on one space-like surface. The future of the state of motion cannot then be 
determined from the field equations, which are in general second-order differential equations. Therefore the 
action principle (171) which was enough for the classical theory is no longer enough. We must make some 
additional statement about the behaviour of 51 for variations Sep" which are not zero on the boundaries of 
Q. 

A state of motion is specified by specifying a space-time surface a and a set of numerical values 4>' " for 
the eigenvalue which the operators (j)" on a have in this state. The state is denoted by the Dirac ket vector 
|(/)'", cr). This is a special kind of state in which the (j)" on <t have eigenvalues: the general state is a linear 
combination of \(p'°',a) with various values of The physically observable quantities are expressions 

such as the matrix element 

(<^;«,c7i|</)'^(x)|0^",a2) (172) 

of the field operator (f>^{x) between the two states specified by <j)'i°' on ai and by on a2. In particular, 
the transition probability amplitude between the two states is 

(</>r,ai|<^^",a2) (173) 

The squared modulus of this gives the probability of finding the values (f)^" for the fields on cri, in the 
motion which is defined by the fields being given the definite values (^2" <^2- 

The Feynman Method of Quantization 

The Feynman method of quantizing the theory consists in writing down an explicit formula for the transition 
amplitude (173). Namely 

ai 1 (T2)=nJ2 exp I ^Inin)^ (174) 

Here H represents a History of the fields between c72 and di, i.e. any set of classical functions (j)"{x) which 
are defined in the region O between o"2 and ai and which take the values on cti and (1)2°' on CT2. Ih{^) 
is the value of calculated with these particular functions. The sum is taken over all possible 
histories, a continuously infinite sum whose exact mathematical definition is not easy to formulate. is 
a normalization factor independent of the particular states considered, chosen so as to make the sum of 
the squares of the amplitudes from a given state to all other states equal to 1. This formula is derived by 
Feynman from very general considerations, applying a Huyghens principle to the solution of wave-mechanics 
just as it is done in wave-optics. By this one formula the whole theory is quantized and the answer to any 
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physical problem in principle given. The method applies not only to field theory but to ordinary NR 
quantum theory too. We do not try to derive or justify the Feynman formula here. We just show that it 
gives the same results as the usual QM. For a discussion of the difficulties in defining the sum "Y^^j, and a 
method of doing it in simple cases, see C. Morette, Phys. Rev. 81 (1951) 848. 

From formula (174) we derive at once the most general Correspondence Principle giving us back the 
classical theory in the limit as ^ 0. For suppose ft then the exponential factor in (174) becomes an 
extremely rapidly oscillating function of H for all histories H except that one for which /(il) is stationary. 
Therefore in the limit the sum reduces to the contribution from the classical motion leading from (l)'2' 
on (T2 to (f)'^°' on (Ji, all other contributions interfering destructively. The classical motion is defined by 
the condition that^'' its 51{il) = for all small variations of the between and a\. This passage to 
the classical theory is precisely analogous to the passage from wave-optics to geometrical optics when the 
wave-length of light is allowed to tend to zero. The WKB approximation is gotten by taking h small but 
not quite zero. 

To establish a connection between the Feynman method and the ordinary method of quantization, 

Feynman has to define what he means by an operator in his formulation. This he does as follows: Let x be 
any space-time point inside Let 0{x) be any field operator defined at x, for example (p^{x) or 4^^{x). 
Then 0{x) is given a meaning by defining its matrix element between the states \(j)'2',(^2) and , <Ji) , 
where (J2 and ai are any two surfaces to the past and future of x. This matrix element is 

{4>[",a^\0{x)\4>'^'',a2) = NY,OH{x)e^v{UH(Sl)\ (175) 

The number Or is just the value which the expression O takes when the arc given the values which they 
have in the history H. It is easily verified that the definitions (174) and (175) are physically reasonable 
and give the right formal properties of transition amplitudes and operator matrix elements. 

The Feynman method has one fatal drawback: we cannot use it until we have some way of calciilating 
or at least using the sums over histories, and so far nobody has suggested a practical way of doing this. 
But Schwinger has shown how to derive from the Feynman method an Action Principle formulation of the 
theory which avoids this difficulty. 

The Schwinger Action Principle 

Let the sets of eigenvalues 0^" and (/)2" in (174) be held fixed. Let the numbers 4>%{x) be varied so that 
(I)'h{x) is replaced by (j)'fj{x) + 5(()°'{x) where 5(j)°'{x) is an arbitrary infinitesimal c- number quantity. Let the 
surfaces cti and (T2 be varied so that the point x^ moves to a;^ + 5x^. And let the function ^ also be varied 
so that it is replaced by ^ + where (5 is any expression involving the 0" and t/)^ . Under this triple 
variation (174) gives 

5(.^;«,<7i|<^^«,a2) =A^5^||5JH(0)exp(|^7H(f^))| (176) 
Using (175) this may be written 

S I <A^", a2) = ^ {4>[", I SI{Q) I <^^°, (72) . (177) 

Here 61 (ft) is the operator obtained by making the three variations on the operator Formally 61 (Q) 

is the same as the variation obtained in the classical theory, 

«(")^X{-.|:(|f-il|)^- 
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Only now everything on the RHS of (178) is an operator. 

Now what is the meaning of this triple variation as applied to the left side of (174)? Since the (j^fji^) 
are only variables of summation, the change from (f>ff(x) to </>^(a;) + Sip^ix) only affects the left side in 
changing the boundary values which (l)fi{x) must take on ai and (72. Thus instead of 4>h{x) = 4>f"{x) on 
CTi we now have the new summation variable 

^% (x) + <5<^" = 0f' (x) + Sc^" on (71 

Therefore the change in <pfj is equivalent simply to changing 

(f)^" to (Tif + on (71 

to <t>2+H°' on (72. 

The change in ^ and in the position of a produces a change in the left side of (174), by virtue of the 
change in the operators (f)" on cri and (72 resulting from the variations 6^ and dx^ in consequence of the 
field equations. 

Hence the net result of the triple variation on the left side of (174) is to give the change in the matrix 
element (0i", cr\ \ (f)^ , (72) if the (/)"" and (t)^' are left fixed, the operators 0"(a;) on a\ and (72 being modified 
in consequence of the variations 5^ and 5Xfj, according to the field equations, and in addition (ff {x) on a\ 
and (72 being changed to (/)"(.t) — (5(^"(.t). 

Schwinger takes equation (177) as the fundamental principle for setting up the quantum theory. Thus 
he gets rid of the unpleasant Y^u- Out of this action principle come very simply all the main features of a 
quantum field theory, A - G below. 



A. The Field Equations 

If we take the special case of a variation 5(j)°' which vanishes on the boundary of f2 and = 5x^ = 0, 
then ((^"', (Ti I (/)2', (T2) depends only on the operators (ji" on <Ji and (J2 and is unaff'ected by the variation. 
Therefore for all such variations 

5I{^) = (171) 

^dx.dr^-^ ^^^^^ 

That is to say, the classical action principle and the classical field equations are valid for the quantum field 

operators. 

We see that (177) is exactly the kind of generalization we want of the old variation principle (171). It 
includes the information, necessary for a quantum theory, concerning the eff'ect on 7(0) of variations not 
vanishing at the boundary of O. 



B. The Schrddinger Equation for the State-function 

Specialize cti and C72 to be the whole space at the times t\ and t2- Then 

(^r , (71 I (/)^", (72) = ((^i", tl I ,/)^", i2) = * (^r , tl) 

is a Schrodinger wave-function giving the probability amplitude for finding the system in the state 4>'-^ at 
the time ti, given the initial conditions (j)'^ at ^2- The development of ^ {4>'i'^ ti) with time ti is thus a 
description of the development of the state of the system with time in the Schrodinger representation. 

Take in (177) a variation in which Scj)'^ = 8^ = 0, the surface a\ being just moved through the 
displacement 5t in the time direction. Then using (165) and (164) 

5^ (</)'!«, ti) = -'- {ct>[^, il I if (il) I <^^« , t2) 6tl 
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or 

ihj^ , ti I c/)^" , t2) = (<^'i" M\H{ti)\ (/.'a" M) (180) 

This is the ordinary Schrodingcr equation in Dirac's notation. It shows that the Schwinger action principle 
contains cnougli information for predicting the future behaviour of a system given initially in a known 
quantum state. 

C. Operator Form of the Schwinger Principle 

Feynman defined operators by giving the formula (175) for their matrix elements between statcis spcicified 
on two different surfaces. The initial state had to be specified in the past, the final state in the future, the 
operator referring to some particular time which is taken as present. 

The usual and generally more useful way of defining operators is to specify their matrix elements between 
states defined on the same surface. Thus we are interested in a matrix element 

cr|O|0"", a) (181) 

where 0'" and 0"" are given sets of eigenvalues and a is a surface which may be past, present or future in 
relation to the field-points to which O refers. 

Suppose that a reference surface Uq is chosen in the remote past. Let the 0", a and ^ be varied in such 
a way that everything on remains fixed. For such a variation, (178) gives if we assume that (179) holds 

5I{0) = ^ j S^d'^x + J ^TTad(l)"' + (^^ n^^ - ^'^■jtc,^ Sx^,^ da (182) 

where fl is the region bounded by CTo and a. Let us now first calculate the variation of (181) arising from 
the change in the meaning of the states |(/)'",(t) and |(/)"",(t). The operator O itself is at this point fixed 

and not affected by the variations in 0", a and Then 

((/.'", a I c? I r", = E E (<^'"' ^1 '^o", ^o) i^o", I o I . (^o) (<^r . <^o i c/-"", a) (i83) 

K 'Pa 

therefore, denoting 

(^'",CT|C»|0"",a) = {a'\0\a") etc., we have 

S{a'\0\ a") = J2Y.(^(^'\ <)) (^o I O I K I + E E I <) ^<\<^\ <) K I 
I II I II 

because \(j>'^) and |(?io are not changed by the variation, and neither is O. Therefore, using (177) we have 

6{a'\0\ a") = E E ^ I ^ I + E E ^ I ^ I '^") 

I II I II 

where the subscript a — Gq refers to the surface integrals in (178). Since 51^-^^ = —Sla^-a, we get finally 

6{cp'",a\0\<f>"",a) = ^{cl>'",a\[6I{Q),0]\<f>"",a) (184) 

where [-P, -R] = PR — RP- This applies for the case when O is fixed and the states vary. 

Now we want to calculate the variation of ((/)'", cr 1 1 0"", cr) for the case when the states arc fixed, and 
O = O ((/)"(c7)) changes. This, however, will be the same as for the previous case, except with the opposite 
sign, because the variation of the matrix element^* 

{(j)"^,a\0\(j)"",a) (185) 

if both the states and O change simultaneously is zero. Therefore, if we use a representation in which matrix 
elements of O are defined between states not subjc^ct to variation we get^® 

ih50{(T) = [ SI{n),0{a) ] (186) 

This is the Schwinger action principle in operator form. It is related to (177) exactly as the Heisenberg 
representation is to the Schrodinger representation in elementary quantum mechanics. 
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D. The Canonical Commutation Laws 

Taking for a the space at time t, for 0{a) the operator (l)"{r,t) at the space-point r, and Sx/j, = (5Jf = 
we have by (182) and (186) for an arbitrary variation Scj)" 

-ihSr{r,t)=Y^ I [Mr',t)5cj,^{T',t), r(r,t)]rfV' (187) 

because da = —n^da^ = —i{~idx'idx'2dx';^) = —d^r' by (158); the unit vector in the increasing time 
direction is i, and this is the outward direction since we choose Uq in the past. Hence for every r, r' 

[ 0" (r, t),^p{r',t)]^ih 6-' {r-r') (188) 

Also since the 0"(r) on a are assumed independent variables, 

[r{r,t),(j>l'{r',t)]=Q (189) 

So this method gives automatically the correct canonical commutation laws for the fields. There is no need 
to prove that the commutation rules are consistent with the field equations, as was necessary in the older 
methods. 



E. The Heisenberg Equation of Motion for the Operators 

Suppose that (T is a flat surface at time t, and that a variation is made by moving the surface through the 
small time 6t as in B above. But now let 0{t) = 0{a) be an operator built up out of the field-operators 
(jf on a. Then by (165) and (186) the change in 0{t) produced by the variation is given by 

ih50{t) = [-H{t) St, Oit)] 

That is to say, 0{t) satisfies the Heisenberg equation of motion 

in^ = [0{t),H{t)] (190) 
where H{t) is the total Hamiltonian operator. 



F. General Covariant Commutation Laws 

From (186) we derive at once the general covariant form of the commutation laws discovered by Peierls in 
1950 [13]. This covariant form is not easy to reach in the Hamiltonian formalism. 

Let two field points z and y be given, and two operators TZ{z) and Q{y) depending on the field quantities 
(j)" at z and y. Let a reference surface ao be fixed, past of both z and y. Suppose the quantity, 

6n{^) = eS^ix-z) Tl{z) (191) 

is added to the Lagrangian density ^{x), where e is an infinitesimal c- number This will make at most 
a certain infinitesimal change edTi<l>"{x) in the solutions (j)"{x) of the field equations. Supposing the new 
(j)" {x) to be identical with the old one on CTo then Sntp" {x) is different from zero only in the future light-cone 
of z. 

Similarly adding 

6Q{^) = eS\x-y)Q{y) (192) 

to ^(x) produces at most a change e6Q4>°'{x) in the (j)"{x). Lot t5'nQ{y) be the change in Q{y) produced 
by the addition (191), while e 5QTi{z) be the change in 'R-{z) produced by (192). Suppose y lies on a surface 
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a lying in the future of z. Then we take Q{y) for 0{a) in (186), and 5^ given by (191)- The 6I(yt) given 
by (182) reduces then simply to 

51{n) = -en{z) 
c 

It is assumed that there is no intrinsic change 5(j)°' of (j)"' or Ja;^ of a apart from the change whose effect is 
already included in the 5^ term. Thus (186) gives 

[n{z), Q{y)] = -ihcSaUiz) {zo>yo) (193) 

When y and z are separated by a space-like interval, the commutator is zero, because the disturbance TZ{z) 
propagates with a velocity at most c and therefore can affect things only in the future lightcone of z; this 
means 5nQ{y) = in this case. 

Peierls' formula, valid for any pair of field operators, is 

[^(^), Q{v)\ - itic{5nQ{y) - 5qR{z)} (194) 
This is a useful formula for calculating commutators in a covariant way. 

G. Anticommuting Fields 

There is one type of field theory which can be constructed easily by Schwinger's action principle, but which 
does not come out of Feynman's picture. Suppose a classical field theory in which a group of field operators 
tp" always occurs in the Lagrangian in bilinear combinations like ip^tl)" with the group of field operators 
ip. Examples, the Dirac and the quantum electrodynamics jSfg. 

Then instead of taking every (f)" on a given surface a to commute as in (189), we may take every pair 
of ip" to anticommute, thus^" 

{V"(r,t), v^(r',f)} = {p,n} = rTi + np (195) 

The bilinear combination will still commute, like the (f>"^s did before. The -0" commute as before with 
any field quantities on a other than the ijj and ip. Schwinger then assumes (177) to hold precisely as before, 
except that in calculating dI{Q) according to (178), the variation Sip" anticommutes with all operators tp" 
and tp^ . In these theories it turns out that the momentum tTc conjugate to tp" is just a linear combination 
of tp, because the Lagrangian is only linear in the derivatives of tp. With the anticommuting fields the field 
equations (179) are deduced as before, also the Schrodinger equation (180), the commutation rules being 
given by (186) and (187). But now in order to make (187) valid, since 5ip^ anticommutes with the tp and 
TT operators, the canonical commutation law must be written 

{^«(r,t), n0{r',t)} = -ihSc^^S^r - r') (196) 

The general commutation rule (194) is still valid provided that Q and TZ are also expressions bilinear in the 

tp and tp. 

The interpretation of the operators, and the justification for the Schwinger principle, in the case of 
anticommuting fields, is not clear. But it is clear that the Schwinger principle in this case gives a consistent 
and simple formulation of a relativistic quantum field theory. And we may as well take advantage of the 
method, even if we do not quite understand its conceptual basis. The resulting theory is mathematically 
unambiguous, and gives results in agreement with experiment; that should be good enough. 



Examples of Quantized Field Theories 



I. The Maxwell Field 
Lagrangian 

Field equations 

Commutation rules for the ^a: To find these we uses the Peierls method. Take two points y and z 

with Z{) > Hi). Let Q{y) = A\{y), TZ{z) = A^{z). Note: in this section x, y, z, k etc. are meant to 
have components 1, 2, 3 and 0, while in x^, y^, 2^, fc^, etc. we mean = 1,2,3 and 4. When Sq{^) = 
e5^{x — y)A\{y) is added to ^m, the field equation for Afj, becomes 

n'A^ + 5,^^€5\x-y) = (198) 

This equation is satisfied by A^ + JqA^(2) (by definition), and hence also by 5qAh{z) because of (197). 
Therefore 5qA^{z) is defined by the conditions 

U\5QA^{z))^-5^^S\z-y) 

8qA,,{z) = for 20 < ?yo. (199) 

That is to say, (5q^^(z) is a c- number and is the retarded potential created by a point source acting 
instantaneously at the space-time point y. 

5qA^_,{z) = Sx^^Dniz - y) 
a^Dn{z-y) = -6\z-y) (200) 



If X is any 4- vector, using 



we get 



Here 



6{x'^ ~ = ^ {'^(^ - a) + S{x + a)} , a > 



Dr{x) = ^e{x) S{x') = ^ S{xo - \r\) (201) 



\r\ = y xl + xl + xl; Xo = ct; x^ = r"^ — x^ Q{x) 
In the same way 



+1 for a; > 
for a; < 



SnAxiy) = Sx^^D^iz - y) = 5x^DR{y - z) (202) 
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where Da is the advanced potential of the same source, 

Hence we have the the commutation rule (194) 

[A^{z), Ax{y)] ^ihc6x^ [D a{z - y) - D r{z - y)] 

— ihc 6x^D{z — y) (definition of D) 

This invariant /^-function satisfies by (200) 

n''D{x) = -^S^ix) - i-S^x)) = 

as it must. Also 

1 



(203) 
(204) 



D{x) 



Momentum Representations 
We have 



47r|r 
1 

^2^ 



- [S{xo + \r\)-S{xo-\r\)] 
e{x)S{x^) e(a;) = sign(a;o) 



(205) 



exp{ik ■ x) d'^k 



where the integral is fourfold, over dki dk2 dk^ dk^. Therefore 

If 1 

^Ri^) = exp(ifc • x)— d^k 



(206) 



(207) 



where fc^ = |fcp — fcg. The integration with respect to fci fc2 fcs is an ordinary real integral. That with 
respect to ko is a contour integral going along the real axis and above the two poles at ko — ±\k\. 





o — - 







For detailed calculations, see the Appendix below. This gives the correct behaviour of Dfi being zero for 
xq < 0. Similarly 

DAix)^j^Jexp{ik-x)^d''k (208) 
with a contour going below both the poles. Therefore 



If 1 
D{x) = j^-:^ J exp{ik ■ x)-^ d^k 



(209) 



with a contour s as shown. 
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(27r)3 



exp{ik ■ x) 6{P) e(fc) d^k 



Evaluating the residues we have 

D{x) = - 

this being now an ordinary real integral. 
Appendix 



Let us verify, e.g. (207). For xo < 0, we have to take the top path shown, 
otherwise the integrand blows up; this gives obviously 0. 
For xq > 0, we have to take the bottom path; then 



(210) 



Dnix) = 



1 



(27r)4 



■ d^kdko 



where k and x are 3 dimensional vectors. 
Now because of the clockwise direction 




-iknxo 



dko = — 



-ikoxo 



■ dko 



{ko - \k\){ko + \k\) 

27ri (Residue at fco = |fc| + Residue at ko = — \k\) — 27ri 



g-i[fc[xo g+i\k\xo 



2\k\ 



2|fe| 



Hence 



n„('™^ _ ^ f _}_ ik x ( i\k\xo _ -i\k\xo\ 

^" (2^)3 J 2\k\ I / 



(27r) 



* 27r JJ -i-e^l'=ll^l™^»|e*l'=l^«-e-*l'=l^°}|fc|2d|fc| sinOde 



1 1 
(27r)2 y„ 2\k\i\k\\x 

1 1 



|gi|fc|a;o _ g-j|fc|a;o| |j^|2 / g'i|fc||a;|Q 



d\k\ 



47r2 2\x\ 

1 1 

47r|a;| 27r 
1 



i|fc|(a:o + |x|) _ ^i\k\{xo-\x\) _ ^~i\k\{xo-\x\) _|_ ^-i\k\(xo + \x\) 



J^gi\k\(xa 



+ 1^1) _ g*l'=l(^o-|a; 



)}d|fe| 



AttIxI 



d{xo — \x\) for xq > 0. 



Fourier Analysis of Operators 

Let us analyze the potential A^ into Fourier components 

^ ^ J 1^1^^^^ Wkf^ exp{ik ■ x) + tikfj. exp(— ifc • x)} 



(211) 



where the factor |fe|~-'^/2 appears only as a matter of convenience; the actual Fourier coefficients are then 
\k\~^^'^ akfi and \k\~^^'^ dkii- The integration is over all 4-vectors {k) with ko = +|fc|- -B is a normalization 
factor to be determined later. The a^^ and dki_i are operators independent of x. 
Since Ai, A2, A3 and Ao are Hermitian, 



dk^i = Ok^ = Hermitian conjugate of Ofc^, ^ — 1, 2, 3, and therefore 
cikA — —CLki — —Hermitian conjugate of 0^4 



(212) 
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Computing the commutator [^^(2), Ax{y)] from (211) and comparing the result with (203) in the 
momentum representation (210), we have first, since the result is a function of {z — y) only^^ 

[akfi,, ftfe'A ] = 

[flfe^, afe'A]=0 (213) 
[ ,ak'x]=5^{k- k') S^x 

And the two results for the commutator agree then precisely if we take 



Emission and Absorption Operators 

The operators A^{x) obey the Heisenberg equations of motion for operators (190) 

ih^ = [A^,H] (190a) 

Therefore the operator a^^ exp(ifc • x) has matrix elements between an initial state of energy Ei and a final 
state of energy E2, only if 

ih{-icko) = Ei-E2 = hc\k\ (215) 

because by (211) and (190a) wc have 

ipi ih{-icko)akn ip2 = ^1 hc\k\ a^^ V2 = tpi [d/nfc' H]'ip2 = (Ei - E2)ipi a^k i>2 (216) 

Now hc\k\ is a constant energy, characteristic of the frequency uj = ck characteristic of the particular Fourier 
components of the field. The operator ak^ can only operate so as to reduce the energy of a system by a 
lump of energy of this size. In the same way, dk^ will only operate when 

Ei-E2 = -hc\k\ 

to increase the energy by the same amount. 

This is the fundamental property of the quantized field operators, that they change the energy of a 
system not continuously but in jumps. This shows that our formalism includes correctly the experimentally 
known quantum behaviour of radiation. 

We call Qk^ the absorption operator for the field oscillator with propagation vector k and polarization 
direction fx. Likewise Ofc^ the emission operator. 

We have thus 4 directions of polarization for a photon of given momentum. There are not all observed 
in electromagnetic radiation. Free radiation can only consist of transverse waves, and has only 2 possible 
polarizations. This is because the physically allowable states * are restricted by a supplementary condition 

E^* = (217) 

where is the positive frequency part of A^, i.e. the part containing the absorption operators. In the 
classical theory we have 

^ ox^, 

the condition imposed in order to simplify the Maxwell equations to the simple form D^A^ = 0. In the 
quantum theory it was usual to take 
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but this means that photons of a certain kind cannot be emitted in a physical state which is physically 
difficult to understand and brings mathematical inconsistencies into the theory. So we assume only (216) 
which only says these photons are not present and cannot be absorbed from a physical state, which makes 
good sense. Also in the classical limit Y^^dA^/dxn is a real quantity, and so = follows 

correctly from (216) alone. 

The method of using (216) as supplementary condition is due to Gupta and Bleuler^^; 

S. N. Gupta, Proc. Roy. Soc. A 63 (1950) 681. 
K. Bleuler, Helv. Phys. Acta 23 (1950) 567. 

The older treatment is unnecessary and difficult, so we will not bother about it. 
By (211), (216) is equivalent to assuming 

[k^ak^) * = (216a) 

for each momentum vector fc of a photon. 

As a result of this work of Gupta and Bleuler, the supplementary conditions do not come into the prac- 
tical use of the theory at all. We use the theory and get correct results, forgetting about the supplementary 
conditions. 

Gauge-Invariance of the Theory 

The theory is gauge-invariant. That is to say, adding a gradient^^ A^ = dA/dXfj, to the potentials does not 
change the fields by anything which is physically observable. Therefore all states which differ only by such 
an addition to the potentials are physically identical. 
If ^ is any state then 

is a state obtained from ^ by emitting a pseudo-photon with potentials proportional to dA/dx^j,. Hence ^' 
should be indistinguishable from 

Now if ^'2 is any state whatever satisfying the supplementary condition (216a) the matrix element 

(*'*, *i) = (** (l + A k; al^) , *2) 

= {^*, (l + A k, 4 J ^2) = i^*, *2) (216b) 

Hence the matrix elements of and * to any physical state ^"2 whatever are equal, and so the results 
of the theory are all independent of whether the state ^ is represented by the vector ^ or by This is 
enough to show that the theory is properly gauge-invariant, in spite of the fact that states are specified by 
the potentials which are not themselves gauge-invariant. 

The Vacuum State 

The vacuum state is by definition the state of lowest energy, so that all absorption operators operating on 
it give zero: 

ak^-^o = (217a) 

and therefore by (212) 

(afc^*o)* = (*:4^) = ±(*:afc^) = (217b) 
Given any operator Q, we are interested in the "vacuum expectation value" of Q defined as 



{Q)o = iK, 2*0) 



(218) 
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Then we have at once 

(afcpafc'A)„ = by (217a) (219a) 

{ak^ak'x), = by (217b) (219b) 

{ak^.ak'x), = by (217a,b) (219c) 

And by the commutation laws (213) and (219c) we have 

(akt^ak'x)^ = ( [flfep, ak'x])^ = 6^{k - k') Sf,x (220) 

The vacuum expectation value {A^{z) Ax{y)) ^ is thus just the part of the commutator [A^{z), Ax{y)] 
which contains positive frequencies exp{iA; • {z — y)} with ko > 0, as one can see using (211), (219) and 
(220). Thus^s 

{A^{z) Ax{y))^ = ihcS^x D+{z - y) (221) 

D+{x) = -—^ [ exp{ik-x)S{k^)e{k)d^k (222) 
K^'^) Jko>o 

We write 

D{x) = D+{x) + D-{x) (223) 
D+ = \[d- iD^^^^ £)- = i (d + (224) 
The even function D^^^ is then defined by 

{A^{z) Ax{y) + Ax{y) A^{z))^ = hc6^x D^'\z - y) (225) 

D^i' (x) = I exp(ifc • x) 5{k^) d^k (226) 

(2iTy J 

It is then not hard to prove (see the Appendix below) that 

(227) 

The functions D and D^^^ are the two independent solutions of D^Z) = 0, one odd and the other even. 
Then we define the function 

D{x) = -\e{x) D{x) = i (Dnix) + Da{x)) = ^ (^(a;^) 

= (^/exp(*/c.x)^d4fc (228) 

the last being a real principal value integral: This is the even solution of the point-source equation 

U'Dix) = -5^{x) (229) 

Appendix 

^'''^^^= (2^)^/ «^'"^(^')'^'^ 

^T^j J -I J-oo Jo 2|A;| 



{2n) 

2t:Y Jq i\k\\x\ L J 2\k\ L J 

11/"°° 1 1 

= sin(|A;||a:|) cos(|A;|a;o) d\k\ = ^^l*:! {sin((|a;| + a;o)|fe|) + sin((|a:| - a;o)|fe|)} 



(27 
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Taking the integral in the Abehan sense 

a 1 



hence in our case^^ 



Urn / 



lim / e sin ax dx = lim 



e^o + a? a 
1 1 r 1 11 1 



27r^ 2|a;| l|a;|+a;o |a;| — xo j 2-k'^x'^ 



The Gupta-Bleuler Method 

There is one difficulty in the preceding theory. We assume according to (220) 

(afep4'A>o = ±'^^(fc-fc')^MA (220a) 

Here the plus sign holds for /U = 1,2,3; the minus sign for /x = 4. Now if the operators a^^, a^^ are 
represented in the usual way by matrices, as it is done in the elementary theory of the harmonic oscillator 
(see Wcntzcl p. 33. Eq. (6.16)), the vacuum expectation values of a product (a^^ '^kfj ^i^^ always be positive, 
i.e. the plus sign should hold in (220a) for /z = 4 also. In fact {uku a^^) will have a positive expectation 
value in any state whatever, if the photon oscillators are treated as ordinary elementary oscillators. 

Therefore we must distinguish between the scalar product (^'*, ^'2) as we have defined it by our covariant 
theory, and the scalar product (\1/|, '^2) e which one would calculate using the explicit matrix representation 
of the operators. The product (^ff, ^'2)^; has no physical significance, because the matrix representations of 
the afc4 refer to states with photons polarized purely in the time dimension, which cannot occur physically. 
However, it is convenient also to be able to use the matrix representations in practice. 

To use the matrix representations, we have only to define an operator ry by the condition 

77$ = (-1)$ (220b) 

where $ is any state in which there is a definite number A'' of photons polarized in the 4-direction. Then 
the physical scalar product is given in terms of the explicit matrix representations by 

{^l, 5-2) = r]^2)E (220c) 

The definition (220c), introduced by Gupta, makes the matrix representations consistent with all the re- 
quirements of the covariant theory, in particular also it gives (220) correctly. The physical scalar product 
is thus an indefinite metric regarded from the point of view of the matrix representations. However, we 
have seen in (216b) that for any physical states the scalar product (^P*, ^'2) is equal to {'^\rp^ ^it) where 
^iT is a state involving transverse photons only, and hence is positive. Thus for physical states the metric 
is definite and this is all that we require of it. 



Example: Spontaneous Emission of Radiation 

This is a purely quantum-mechanical effect. A classical treatment, considering the reaction of the atom to 
a classical applied Maxwell field, gives a correct account of the absorption of radiation and of stimulated 
emission, but fails to give the spontaneous emission. 

Let an atom have 2 states, the ground state 1 and an excited state 2 with energy ficg. For the transition 
2 — > 1 let the charge-current density of the atom have the unintegrated matrix elements 

ipiA{x) = 3^_iA{r■, t) at the point x = (r, t) 

The interaction with the Maxwell field has matrix element^'^ 



^ = --J E^-?MA(r,i) {A^{r,t))^^^d^r 



(230) 
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for making a transition with emission of a photon. The total emission probabihty per unit time is obtained 
using time dependent perturbation theory:^^ 

w=^> ■\aATr = ^ > ' ^ I i-1 I \ i,..A(rA){AJrA))_.,^d''r\dt 



photon ^ ^ 

states 

= I [ T.fxA{x')Ui^) {Alix')A,{x)}^ d'x dV (231) 

the integral being over all space for a long time T, the sum over the physical photon states only. It is not 
correct to take in (231) the photon states to be the 4 states with polarization in the directions = 1, 2, 3, 4, 

because these arc not physical states. 

Using a sum-rule to sum over the states 



^ = Tc^tfi 



^JJ E.?AA(^')iM(^) {Al{x')A,{x))^ d'x d^x' 

Write jxA{x') for the matrix element^^ of jxA{x') in the reverse transition 1^2. Then 

fxAix') = VxjxAix'), Al{x') = vxAxix') 

with Tlx = +1, A = 1, 2, 3; 774 = -1. 
Hence 



^ // E j*xA{x')3,A{x){Al{x')A,{x))^d'xd'x' 

X, ju=l 

j d'^k6ik'^)Q{ko)^j^Aik)j^Aik) ^^^j_2 j j dxo exp{i(x - xo)(q - fco)} 



1 r 

E j d'kSm' - q^)j,A{k)3,A{k) (232) 



smce 



^^^^ 2cT 





where^° 



( [ dxodx'Qeic^{i{x — XQ){q — ko)} = cT — — ^ ^ >Tr6{q — ko) if cT —> oo 



j^A{k) = J jMr) e-'" "- d^r j^A{k) = j ~j^A{r) e^'^ '' d V 
By the charge conservation law 

E ^i^jf^A {k)=0 E ^i^ji^A (fc) = 



and so 

^3 1 ^ 
^jnA{k)j^,A{k) = —qjiA qjiA + E = 2 E kdiAik)} + E \jiAik)f 



= cos^ + li^p = |j^|2(l - cos^ e) = IhAf + \j2Af (232a) 
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where 1 and 2 are the two directions of transverse polarization. This shows how the third and fourth 
polarization directions do not appear in real emission problems. The same result would be obtained if we 
used the indefinite metric explicitly, i.e. take the sum in (231) over the 4 polarization states /i = 1,2,3,4, 
with the /Lt = 4 given a minus sign arising from the rj in (220c). But it is always simpler to work directly 
with the covariant formalism, than to bother with the non-physical photon states and then have to use r] 
to get the right answers. 

Finally, the emission probability in direction of polarization 1 and in direction of propagation given by 
the solid angle dCl is, using (232), (232a), and S{\k\'^ - q^) = ^5{\k\ - g) for g > 

- = ^ Wl' (233) 
For dipole radiation by a one-electron atom with coordinates (x, y, z) 

ji = ex = iecqx fe • r <C 1 

and 

e'^q^dn 2 

This checks with Bethe's Handhuch article. 

The example shows how covariant methods will work, even for problems of this elementary sort for 
which they are not particularly suited. The covariant method avoids the necessity of having to think about 
the normalization of the photon states, the factors of 2 and tt etc. being given automatically when one uses 
(221). 



The Hamiltonian Operator 
Prom the equation 

we find 



in^^ = [A„ H] 



[ttk^, H] = hc\k\aktj. 
[akfj., H] = -hc\k\ak^i 

Using the commutation rules (213) we can find an operator H which satisfies all these conditions simulta- 
neously. Namely 

r * 

H= d^khc\k\Y, akxakx (234a) 
1 

This operator is in fact unique apart from an arbitrary additive constant. To fix the constant we require^^ 
= which leads to the result (234a) precisely, as one can see at once from (219). Hence (234a) is the 
Hamiltonian of this theory, which is very simple in this momentum representation. 

To derive H from the Lagrangian is also possible but much more tedious. Prom (234a) we see that 

-^fcA = OfcAflfcA (not summed) 

is an operator just representing the number of quanta in the frequency k and polarization A. It follows at 
once from the commutation rules (213), from the singular (5-function factor which comes from the continuous 
spectrum, Nk\ being in fact the number of quanta per unit frequency range, that J Nkxd^k integrated over 
any region of momentum-space has the integer eigenvalues 0, 1, 2, ... . This is so, because the state with 
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rii particles with momentum fcj is = ni=i(^feiA)"'*o- Then taking J^N^xd^k over Q including the 
momenta ki,k2, ■ . ■ ,kj we get 

i 

Nkxd^k^= / dkxakxT[{ak,xT''^od^k 

^ in ^^-^ 

= [ akxiY.niY[a'Q{-^\akx,dk,x]+Yliak,xT'akx\'i>od^k by (213) and (217a) 

I £ j £ 3 



/ 



«=1 2=1 i—1 i—1 i=l 



Fluctuations of the Fields 

Since the electromagnetic fields E and H are quantum-mechanical variables, they do not have well- 
determined values in any state in which energy and momentum are well-defined, for example the vacuum 
state. A state of the fields can be specified either by fixing the values of E and H or by specifying the 
numbers of quanta present with various momenta and energies. The two descriptions are complementary 
and are both possible only in the classical limit of large numbers of quanta and very strong fields. 

An educational discussion of these questions, considering in detail the example of a cavity resonator with 
one mode of oscillation, has been given by L. P. Smith, Phys. Rev. 69 (1946) 195. This is worth reading. 
Essential is the fact that you cannot fix the time-dependence of the field (phase) with a fixed number of 
quanta (energy). 

Wo consider a more general problem. What is the mean-square fluctuation in the vacuum state of a 

field-quantity? We define 

E,{VT) = ^ I^^E,{x)dTdt 

H,iVT) = -^f H,{x)dTdt (235) 

averaged over some finite space-volume V and also over a time T. Let V{k) = Jy e"'*' '' dr. Then since 
iJ = V X A, we have 

{HiiVT)f)^ = ^ J J drdr'dtdt' - ^M) [j^^A', - ^A'^ 

1 he f f , ,,,,,/ 9 d 



Y2J<2 2 

1 he 
V2T2 16^ 

he 



167r3y2r2 



jj drdr'dtdt' (^^ + ^^)^«(--') using (225) 

/ / drdr'dtdt' + -^^) I d^fce*(^-"')5(fc2) using (226) 

J J ydx^dx'^ dxzdx'^J j 

r r f ^(fc2 + fc^)|y(fc)|'e^l'=l^''e-^l'=l^orfidt' 
Jo Jq J \'^\ 



he f d^k 2 , 2n m2 



167r3y2y2 J |fe| ^ 2 ' a/ I V VI ^.2|^|2 

he fd^k..,.2 ,2mta/,m2 4sin^(5c|fcT) / r „ r',rrr.^^2 



167r3y2T2 



(236) 
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Taking for V any finite volume and T a finite time, this mean-square fluctuation is finite. Example: a 
sphere of radius R gives 

V{K) = (sini?|fe| - i?|A;| cosi?|A;|) (237) 
|fe| 

But if either R or T tends to zero, the fluctuations tend to oo and in the limit actually diverge. That 
is to say, only measurements of fleld-quantities averaged both in space and in time can have any physical 
reality. 

Fluctuation of Position of an Electron in a Quantized Electromagnetic Field. The Lamb Shift. 

Consider an electron represented by an extended spherical charge of radius R, lying in a stationary state 

in the potential (j){r) of a hydrogen atom, ft has a certain wavc-fimction ilj{r). We consider everything 
non-relativistically except for the quantized radiation field with which the electron interacts. The effect of 
this fluctuating fleld is to produce a rapid fluctuation in position of the electron. In fact we have for rapid 
fluctuations 

mr = —eE 

Thus a fluctuating component of E with frequency c\K\ produces the same fluctuation in r with amplitude 
el 

miiltiplied by a factor . The slow fliictuations of E cannot be followed by the electron if the 

frequency is less than the atomic frequency cKh- Hence we find from (236), making T 

he r°°d^K,„^ o 1 



because lim — 5— = 1. The integral now converges at 00, because of the finite size of the electron. Since 



R is very small we may approximate (237) by 

V{K): 



|7ri?3 = V for \K\R < 1 
for \K\R > 1 



Then, since {K^ + K^) = \K\'^{1 - cos^ 6*) and /J" sin^ i9 sin 6* ^6* = | , we have^^ 

^ 2x _ fi'ft f^'"- d\K\ _ e^h / 1 \ 

Vi/o ^ 6m2c37r2 J^^ \K\ ~ Qm^c^n^ °^ \RKh) ^ ' 

This fluctuation in position produces a change in the effective potential acting on the electron. Thus 

{V{r + 5r)) = V{r) + {6r • VV{r))^ + i {{Srf)^ ^ + " • • 
= V{r) + ^{rl)V'V 

because {6r ■ yV{r))g = 0, being odd. Now in a hydrogen atom, \/^V = e^6^{r) (Heaviside units!) Hence 
the change in the energy of the electron due to the fluctuations is^^ {co = Bohr radius) 



AE = j tp* SVxpdT =-\{rl)^e' 



■ log I 1 — for s-states 



= { 127r2m2c3 ^ \RKh J T^n^al (239) 
for all others 
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because for the hydrogen atom'^® (p — r^/SnirEn/ h) 



and 



2-K 



(2^+ |m|)! 



m 



1/2 



Pj'"l(cos^) 



2 



(n-£ - 1)! 



-1 1/2 



2n[(n ■ 



yi3 



V'nO0(0,^,<p) 



1 ^/2 



2 



(n- 1)! 



-I 1/2^ 



2n(n!)3 



(n!) 



M2 



There will also be a (much bigger) addition to kinetic energy, arising from the fluctuations. We ignore 
this on the grounds that it will be the same for all atomic states and so will not give any relativistic 
displacement. Of course this is not a good argument. 

Hence we find the first approximation to the Lamb shift; the 2s state is shifted relative to the 2p states 

by 

A£ = TT^T-^ log ■ 



RKh 



Now37 



J ~ ~~ (Bohr radius) 



Ry = 

Kh 



4 

em 



a mc 

(Rydberg energy unit) 



327r2^4 

Ry 

We take R = (h/nic), the electron Compton wave-length since it is at this frequency that the NR treatment 
becomes completely wrong. Then 

1 



AE = +— log(8 X 137^) Ry 

OTT 



(240) 



a 



Actually — Ry = 136 Mc in frequency units. This gives an effect of the right sign and order of magnitude. 
37r 

Method due to Welton [14]. 

The size of the log is wrong because the low-frequency cut-off was badly done. We find AE ~ 1600 Mc 
instead of the correct value of 1060 Mc. But physically the origin of the shift is correctly described in this 
way. 



la. Theory of Line Shift and Line Width 

To make a better treatment of the effect of the radiation interaction on energy-levels, we must try to solve 
more exactly the equation of motion for the system atom plus radiation field. The effect of the field is shown 
not only in a shift of energy levels, but also in a finite width of the levels due to real radiation. Roughly, if 
the state has a life-time T for decay by radiation, the width F of the level, or the mean variation in energy 
of the emitted photons, is given by the uncertainty principle T w h/T. The line-shift and line-width are 
effects of the same kind and cannot be correctly treated except in combination. 

So we make a theory now which treats the atom non-relativistically, but takes account properly of the 
radiation interaction. This means, we repeat the calculation of spontaneous emission by an atom, but 
now including the reaction of the radiation on the atom instead of taking the atom as given by a fixed 
charge-current oscillator. 

For this sort of calculation it is always convenient to work in a special representation called the Inter- 
action Representation. 
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The Interaction Representation 

In the Schrodinger representation the wave-function \l/ satisfies the equation of motion 

d 

m—^ = (241) 
dt ^ ' 

where H is the Hamiltonian. In the case of an atom interacting with the radiation field, we have 

H = Ha + Hm + Hf (242) 

where Ha is the Hamiltonian operator for the atom, and Hm that for the Maxwell field without interaction. 
Hm is given by (234a), and in quantum electrodynamics according to (170) 

because j^{r) = ieifj^f^ijj^ and in this case = X);^ '""m^m ^ = because tt^ = dJ^/dA^, and ^/ 
does not contain A^. All the operators in (242), (243) are time- independent"^^, Schrodinger representation 
operators, and so are the given the label S. 

Now we choose a new wave function $(f) which is given in terms of ^ by 

*(0 = exp j--^ {Ha + Hm) t| $(i) (244) 

This $(t) will be a constant for any state representing the atom and the Maxwell field without interaction. 
Thus the time- variation of $(t) in an actual state describes just the eff'ect of the interaction in perturbing 
the atomic states. Prom (241) and (244), the time- variation of $ is given by the Schrodinger equation 

ih^ = Hi{t)^ (245) 

with 



Thus 



jtt(r,t) = exp< 




^i^(r)exp| 




A^{r,t) = exp| 


jTlMt^ 


j>^;:(r)cxp. 





Hi{t) = exp |-(ir^ + i/M)i| exp |--(ir^ + Hm)^^ (246) 
Hi = -- I ^ j^(r, t)A^{r, t)d^r (247) 

with 

(248) 
(249) 

These operators j^(r, t) and vl^(r, t) have precisely the time-dependence of the field-operators in the Heisen- 
berg representation, for the two systems, atom and radiation field, taken separately without interaction. 
Thus in the Interaction Representation the time-dependence of the Schrodinger wave-function is split into 
two parts, the operators taking the time-dependence of the non-interaction systems, the wave-function 
taking a time-dependence showing the effects of the interaction only. The operators (r, t) satisfy the 
wave-equation D^A^ = and the covariant commutation laws (203), because we see from (249) that 

^^l^=in[HM,A,{r,t)] 

i. e., the time variation of Af^{r,t) is the same as that of Afj,{x) in the Heisenberg representation without 
interaction (see (190a)), which in turn leads to the field equations (197). Matrix elements of IR operators 
given by (246), (248) or (249) between IR wave- functions given by (244) are of course the same as the 
matrix elements which would be obtained in any other representation. 
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The Application of the Interaction Representation to the Theory of Line-Shift and Line- Width 

Consider the solution of Eq. (245) in which the atom is given initially in a stationary unperturbed state 
O with energy E^, the Maxwell field being in the vacuum state, with no photons present. Let be the 
interaction representation wave-function representing the atom in state O and the Maxwell field in the 
vacuum state, without interaction. $o is independent of time. 

The initial condition = at time t = to is & physically unreal one. It would mean putting the 
atom into existence at time withoiit any radiation field being excited at this time. This wc cannot do 
physically. In fact the initial condition for an atom in an excited state will depend on how it got into 
the excited state. This cannot be formulated in a simple way: one needs a complicated model in order to 
describe the initial excitation of the atom. 

We are interested in calculating the variation with time of ($*<l>(f)), the probability amplitude for finding 
the atom still in the unperturbed state at time t. By (245) we have 

j^{Km) = -\{KHi{t)m) (250) 

Suppose we take the physically unreal initial condition 

$(t) = $o at i = io 

Then (250) will give 



37 {Km)t=u = (251) 



d_ 

dt 

from the Hj given by (247) has zero expectation value in the Maxwell field vacuum, since (r, t) also has 
a zero expectation value in vacuum as one can see from (211) and (217). Thus ($*$(t)) is momentarily 
stationary at t = to. This is however not interesting since the conditions at t ^ to are entirely unphysical. 

The physically meaningful quantity is the value of (250) at a time t a long time after to- Then the atom 
will have "settled down" to a quasi-stationary state of radiative decay, and we may expect that the value 
WC! find for (250) is independent of the particular initial condition chosen and will be correct for an atom 
which has been excited by any reasonable method to the state $o- 

We make the calculation so as to include effects of the radiation Hj up to second order. This means we 
include effects of emitting and absorbing one photon only. In fact we know physically that effects from two 
or more photons are very small, so the approximation is a good one. 

Suppose that {t — to) is long compared with all the atomic frequencies. Then a solution of (245), valid 
to first order in Hj, is 



$i(t) 



Hi{t')dt' 



a{t) $o (252) 



+ terms involving other atomic states with two or more photons present. 

Here a{t) = ($*4>(i)) is a slowly varying amplitude, constant to first order in Hj, representing the slow 
decay of the atom. Note that our treatment is not only a perturbation theory correct to second order in 
Hi, but it also must account exactly for the big effects produced by the radiative decay over long periods 
of time. Therefore wc do not put a(t) = 1 in (252) although this would be correct to first order in Hi. 

If we put a{t) = 1 in (252) we should have just the solution of the radiation emission problem, neglecting 
all effects of radiation reaction on the atom, which we obtained before from Eg. (230). 

The value of --r(^{t) = -r (^o^(O) obtained correct to second order in Hi, and including the radiative 

(J/Jj (JjJj 

reaction effects, by substituting (252) into (250). 
Hence we have ^ 

^ '^a{t) = -\[ dt' {^*oHi{t)Hi{t')^o} (253) 



a{t) dt 
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Using (247), (221) this givcs^^ 

ltt=-i-c fjt' II d'rd'r'D+{r-r',t-t')J2{Mr,t)Mr',t'))^^ 



(254) 

Let the atomic states be labelled by n, the state n having energy En- Let 

j^(n,m) (255) 

be the matrix element of the operator 

/i^We-"""'^'^ (256) 
in the transition m ^ n. Then using a matrix product to evaluate {jijt{r,t)jn{r' ,t')) 



J^/i? /^'^*'E^^p{^(*-*')(^o-£;„-fic|fe|)|Eli^(^.0)p (257) 



Ida 1 r ,1^1. 

a dt IQ-K^tic 



where we made use of (248). 

As before, the sum extends only over the two transverse polarizations /U, the other two cancelling each other 
exactly. Now we have to calculate 

/ e'°^ dx = 7r(5(o) + — = 27r5+(a) (258) 
J-oo «a 

this being the definition of the ^+ function. Thus 
We write 

i^ = -ir-iAi; (260) 

a dt 2 h ^ ^ 

Then AiJ and F are real constants given by 

h 

'16tt^cJ \k\ En - Eo + hc\k\ 
^=8^/ ^J2\j>'^)\'^(En-Eo + hc\k\) (262) 



A^ = -^/^V^^>i^ (261) 



These are independent of t. Therefore the amplitude of the state $o in the wave-function $(t) is given for 
all i > to by 

a{t) = (Km) = exp {-^A^ it - to) - (i - to) I (263) 

The state i>o, as a result of the perturbation by the radiation field, has its energy shifted by AE, and 
decays exponentially like 

|a(t)|2 =e-r(*-*o) (263/a) 
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Comparing (232) and (262), we see that F is exactly the total probability per unit time of radiation from the 
state o to all other states n, calculated neglecting radiation reaction. This gives the physical interpretation 
for the decay law (263/a). When the denominators in (261) have zeroes, the integration over is to be 
taken as a Cauchy principal value. The energy shift AE is exactly what would be obtained from elementary 
second-order perturbation theory, if the difficulties arising from vanishing denominators were just ignored. 

We calculate now the spectrum of radiation emitted in the transition from the level o to the level n, 
including effects of the level shifts AEg and AEn and the widths To and r„. Let bnk be the amplitude at 
time t, of the state in which the atom is in state n and the photon is present with propagation vector k. 
The equation of motion for bnk, including effects of radiation out of the state n, is 



dbnk 
dt 



|-ir„- ■^A£;„|&„fc-Qexp||(£;„-Eo + /lc|fe|)i|a(i) (264) 



with a{t) given by (263).^° Here the last term represents the effects of transitions o ^ n, and Q is the 
space part of a matrix element of Hf which is independent of t and varies only slowly with k so that we 
can regard Q as a constant for all values of k within the line-width. The exponential is the time part of the 
matrix element, the exponent being proportional to the energy difference between the atom in state n plus 
the photon, and the atom in the zero state. The solution of (264) is, taking for convenience to — 0, 

bnk = A {cxp(-/?f) - exp(-7t)} (265) 

using the initial condition bnk = at i = 0. Here 

/? = + ^ [Eo + AEo -En- hc\k\) 

7 = ^r„ + '-AEn (266) 

and A = Q/(/3-7). 

The probability that the atom leaves the state n by a second radiative transition at time t, so that a 
quantum k remains, is 

r„|5„fc(t)|2 

The quantum k remains from the first transition o n. After the atom makes the second transition to a 
continuum of possible states, the final states will no longer be coherent, and so the quanta left behind at 
different times t will not interfere with each other. The total probability for the emission of a quantum of 
frequency k in the first transition is thus 

Pik) = TnlQfj^^ \e-^' - e-^f dt (267) 

Now 

1 r 1 1 1 1 



e 



-/3*_e-^*|2dt = 



1 f 1 1 



/3 - 7 I (7 + r){P* + 7) (/3 + P*KP + r) . 

/? + /3*+7 + 7* 1 R-e(/3 + 7) 



(268) 



(/? + + j*){l3 + 7*)(7 + p*) 2 Re (/?) Re (7) |/3 + 7*|2 
Hence 

P(fc) = \QfE^±IlL ^ (269) 

r„ {Eo + AEo-En-AEn-hc\k\f + lh^ro + Tn)^ 

This formula for P(k) gives the natural shape of a spectral line. The maximum intensity occurs at 

hc\k\ = (Eo + AEo) - {En + AEn) (270) 
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i.e. at the difference between the energies of the two levels including the radiative level shifts. The width 
at half-maximum is 

HTo + Tn) (271) 

just the sum of the two widths of the levels given by their reciprocal life-times. 

These formulae (270) and (271) are important in interpreting the modern radio-frequency spectroscopic 
experiments with their very accurate measurements of line shapes and positions. 

Calculation of Line-Shift, Non-Relativistic Theory 

In all atomic systems, the line widths are finite and easily calculated from the known transition amplitudes. 
For this, non-relativistic theory is accurate enough for all purposes. The line-shift (261) is much more 
difficiilt, and non-relativistic theory is not accurate enough to handle it properly. Still we shall calculate 
(261) using the NR theory, to see what it gives. It turns out to give quite a lot that is interesting. 

First, in a NR calculation we use the dipole approximation which we also used to derive (234). Supposing 
a one-electron atom, the electron having mass m and charge — e, we put 

(nO) = -^{PiU = { / C ("'^^) ^°'^'''} ^^^^^ 



The line shift (261) becomes 



mn^m'^cj \k\ ^ En-Eo^nc\k\ 



and integrating over the direction of fe, (compare with (238)), 



= r \k\ d\k\ y ,,, (273) 

GTT^m^cJo ' ' ' ^ Er, - Eo + hc\k\ ^ ' 



The integral over |fc| is now obviously divergent, even before summing over n. Therefore the line-shift is 
infinite. When a complete relativistic theory with positrons is used, the divergence becomes only logarithmic 
instead of linear, but it still definitely diverges. This was for many years a disaster which destroyed all faith 
in the theory, and no way of avoiding the difficulty was found until 1947. 

The Idea of Mass Renormalization 

The line shift (273) is also infinite for a free electron of momentum p. In this case p is a diagonal operator 
and the sum over n reduces to the term n = 0. Therefore 

A^i' = -ii^^(f *i)p^ (™) 

The effect of the radiation interaction is just to give a free electron an additional energy proportional to its 
kinetic energy {p^ /2m). If the integral in (274) is cut off at an upper limit K ~ (mc/h) in order to allow 
for the fact that the theory anyway is wrong in the relativistic region; then 

67r^ he m 

is a small correction to the kinetic energy, which would be produced by an increase in the rest-mass of the 
electron from m to {m + 5m), 



Sm ■ 
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We must now take into account that the observed rest-mass of any electron, bound or free, is not m 
but m + Sm. Therefore in (273) a part 



d\k\ (276) 



1 e' 
67r^ 

expresses only the effect of the mass-change Sm on the kinetic energy of the bound electron; this part 
is already included in the kinetic energy of the electron, when the observed mass (m -|- 6m) is taken for 
the mass in the formula {p^ /2m). Therefore the part (276) has to be subtracted from (273), to give the 
observable line-shift. The subtraction just cancels out the error that was made in identifying the mass m 
of a "bare" electron without electromagnetic interaction with the observed electronic mass. 

The idea of this mass-renormalization is that, although the "bare" mass m appears in the original 
description of the atom without radiation field, all the final results of the theory should depend only on the 
physically observable m -|- 5m. The idea is originally due to Kramers [16], developed by Bethe {Phys. Rev. 
72 (1947) 339.) 

Subtracting (276) from (273) gives the physically observable line-shift 



■,2^2 



Jo ' En-Eo + hc\k\ ^ ' 



The divergence at high |A;| is now only logarithmic. Taking an upper limit cut-off for the integral at the 
point 

;ic|fe| = K 

where K is an energy on the order of magnitude of mc^, we have 

= 6^ ? - i^-i' ^'''^ 

remembering that the integration over in (277) is to be taken as a Cauchy principal value when (En — Eo) 

is negative. 

From this formula (278) the line-shift for hydrogen states can be calculated numerically, as was done by 
Bethe, Brown and Stehn {Phys. Rev. 77 (1950) 370.) 

Since the log in (278) will be quite large (^ 7) for states n which are in the non-relativistic range, it is 
convenient to write 



J2 {En - Eo) \p^f log \En -Eo\ = IY1 - ^o) \Pnof [ log {E - E^) 

n \ n ) 



(279) 



this being the definition of {E — Eg)^^. Then [E — Eg)^^ is a non-relativistic energy. Exact calculation 
gives for the 2s state in hydrogen 

{E-Eo),, = 16.6Ry (280) 

Thus the important transitions are to states which although non-relativistic are continuum states with very 
high excitation. This is surprising. 

Note that in (278) the terms are all positive if Eo is the ground state. For higher states there will 
be both positive and negative contributions. In particular, we shall see that for a Coulomb potential the 
positive and negative terms cancel almost exactly, for all except s states^^ . The cancellation is more or less 
accidental and seems to have no deeper meaning. 
Now using a sum rule 

Y,{En-Eo)\p^f = {p-[H,p])^^ (281) 

n 

where H is the Hamiltonian for the atom^^ 

^=i^' + ^ ^ = -^7' [H,p]=zhiVV) (282) 
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{P-[H, P] = h' I ■ (VoVl-) dT (283) 



ft2 

2 
2 



where we used Green's vector theorem, the fact that {p ■ [H, p])^^ is real, and the result that V^V = 

e'^6^{r) in Heaviside units. 
Hence*3 



av 

e'^h , K \l/(Trn^a^) for s states 

log ,^ X 1 . r (284) 



127r2m2c3 S^)^ 

Compare this with (239). It differs only in having the log{K/{E — Eo)a.v) replacing log(l/i?iirfl'). The low 
frequency photons have now been treated properly instead of being estimated. Only the high-frequency 
end is still inaccurate because of the vagueness of the cut-off K . Taking K = mc^, (284) gives for the Lamb 
shift 2s — 2p the value IO4O Megacycles. Remarkably close to the experimental value of 1062. 

The success of this calculation of the line shift shows that the correct treatment of the interaction 
between an electron and the Maxwell field, with the help of the idea of mass renormalization, will give 
sensible results in agreement with experiment. This calculation could be done non-relativistically because 
the line shift is mainly a low-frequency and non-relativistic effect. 

There are other effects of the radiation interaction, especially the anomalous increase in the observed 
magnetic moment of the electron by a factor + ^) ^^^^ value given by the Dirac theory, which are 
essentially relativistic in character. For studying these effects, and for calculating the Lamb shift ac;curately 
without an arbitrary cut-off, we need to use a complete relativistic quantum electrodynamics, in which both 
electrons and the Maxwell field are handled relativistically. 

Therefore we must go back to where we left off the theory of the Dirac electron on page 31, and start to 
construct a relativistic field theory of electrons and positrons, similar to the quantized Maxwell field theory. 



//. Field Theory of the Dirac Electron, Without Interaction 

We apply to the Dirac equation the method of field quantization for anti-commuting fields. The reason 
why we must do this, and not use commuting fields, we will see later. Write 

H = {mc/h), m = electron mass 

Lagrangian 

^o = -hc^{y,lx^+f^\i^ (285) 




Note the factor h here. This means that the theory has no classical limit in the sense of the Correspondence 
Principle. In the classical limit, only charges and currents composed of many particles have any meaning; 
the ijj field disappears entirely from view. The h has to be put into (285) to make the dimensions right, 
since (iptp) has dimensions (1/Volume) just as in the 1-particle Dirac theory of which this is an extension. 
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Field equations 



9V 

I 

The charge-conjugate field (f) can be defined by 



(l) = Cip+ 

according to (51), and it also satisfies 

(E7a^+m)^ = (287) 

Covariant Commutation Rules 

We proceed as for the Maxwell field. Take two points z and y with zq > yo. Let 

Q{y) = i'{y)u 

niz) =vip{z) or li){z)v (288) 

Here u and v are spinor operators, not depending on j/ or z and anticommnting with all the V" ?/' 
operators in our equations, as we assumed at the beginning of this section. For example take u — ij}{w) 
where w is a point far away outside the light-cones of both y and z. We make a change in the Lagrangian 
by 

5Q^ = e5\x-y)i,{y)u (289) 

The factor u must be put in to make 5q^ a bilinear expression, which is necessary for the Peierls method 

to be applicable. In fact only bilinear expressions have a physically observable meaning, and it is never 
meaningful under any circumstances to add together a term linear and a term bilinear in the field operators. 
The changed field equations for ^ and tf} are 

For if) : No change 

\x-y)u = (290) 



Thus 5qiI){z) = and edQiljiz) satisfies (290). (Compare with (198).) 
Hence 5qiI}{z) is defined by the conditions 



(Eta^+/^) 5ai'{z) = l^nz-y)u 



(291) 



6q'^{z) = for zq < yo 
From (291), SqiP{z) is a c-number spinor. We write 



6Q^Piz) = -^SR{z-y)u (292) 
he 



Then Sii{x) is a c-number Dirac matrix function of x, satisfying 



(293) 



Sh{x) =0 for xo < 
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and the 4x4 unit matrix I is understood on the right of (293). 
If we have TZ = tjj{z) v then 6tjj = 0, as before. 1{TZ = vtjj{z) then 

Sni^iy) = -^vSa{z - y) 

Sni^iy) = (294) 
where Sa{x) is the Dirac matrix satisfying 



5^(2;) = for xo > (295) 



Finally, if wc had chosen Q — mp{y) wc should have had SqiP{z) = in the same way. 
Hence using the Peierls commutation law (194) with (292) and (294) we find 

[vtp{z), uipiy)] = [ip{z)v, ijj{y)u] = 

[vip{z) , ip{y)u] = -iv[SA{z - y) - SrIz - y)] u (296) 

These hold for every choice of u and v if we now choose u and v anti-commuting with all the V and V 
operators. Therefore writing 

S{x) = Sa{x) - Sr{x) (297) 
we can write down the commutation rules for the operator components: 

bl>a{z), i^M) = ii^ai^), i>f){y)) = o (298) 

{Va(^), i^piy)] = -iSa/siz - y) (299) 
The invariant 5— function satisfies by (293) and (295) 

There are no simple formulae for the S'-functions like (261) and (265) for the D-functions, in coordinate 
space. However in momentum representations the 5-functions are equally simple. 

Momentum Representations 
Write 

Sr{x) = (^X^7a ^ - ^r{x), Sa{x) = ^X^7a ^ - ^a{x) 

(E7.^-A*)a(.) (301) 
Then we have 



S{x) 



(□2 - ^2) Ar{x) = (D^ - ^2) A^(a;) = -5\x) 

(□2 - n^) A(x) = A(:r) = {Aa - Ar) (x) (302) 

with the boundary conditions as before. And the A-hmctions arc exact analogues of the D-functions, the 
D-functions being the special case /x = 0. Instead of (207) we have by making the formal substitution 
fc2 ^ + fj,^ 



EXAMPLES OF QUANTIZED FIELD THEORIES 



62 



where the contour in the fco-plane goes above the two poles at ko = i-^/jfep + jj,'^. Similarly for (208). And 
instead of (210) 

A{x) = - J e'"-"^ 5{e + m') e{k) d^k (304) 
Hence using (301) and the notation (110) 

S{x) = I e'^-'' a + iij.)S(k'^ + jjL^) eik) d'^k (305) 

[2ny J 

Note the projection operator A_|_ appearing here, as defined by (115) with the momentum p = hk. Thus 
the S-function distinguishes automatically between the electron states ko = +^J\k\'^ + fi'^ and the positron 
states ko = —^/\k^+|J^ 

Fourier Analysis of Operators 

We analyze ipa into Fourier components, written in a quite general form: 

where, as in (211), the factor (/x^/(|fep + ji"^))^^^ only makes the notation simpler. The integration is over 
all 4-vectors k with ko = +-\/|fep + /x^. Por each k, the sum J2u+ spin-states u satisfying, 

by (111), the equation 

0-ifj,)u = O (307) 
and the sum J2u- o^^'" spin-states u satisfying by (112) 

+ ifi) u = (308) 

the normalization being given by (106) and (113). The bku are operators independent of x and a, whose 
properties are to be determined. 

Taking the adjoint to (306) we have 

V^„(x)=Q J ^'fc( |fc/ o ) ^ lY.^luUae-'"-- +Y.^luUae"'A (309) 

Here the are ordinary Hermitian conjugates of 6fe„. 

Computing the anti-commutators (298), (299) from (306) and (309), and comparing the results with 
the momentum integral (305), using (115) and the properties of A+ we have 

{6fc„,6fc-„} = {feL,&fc'J=0 (310) 
6k} = '5'(fe-fe')'5„. (311) 

and we find that the constant Q in (306) and (309) is given by*^ 

Q = (27r)-3/2 (312) 

Emission and Absorption Operators 
Let 



Ek = hc/W+Ji^ (313) 
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be the energy of an electron or a positron with momentum hk. We apply the same argument which led to 
(215) for the Maxwell field. Prom this it follows that 

bku for electron states u 
for positron states u 

have matrix elements only for transitions from an initial state of energy Ei to a final state of energy E2, 
where 

Ei-E2= Ek (314) 

And 

bku for positron states u 
for electron states u 

have matrix elements which are non-zero only when 

E2-Ei= Ek (315) 

Thus we see as before that the field has the properties we require of a quantized field. It can carry energy 
only in discrete lumps of magnitude Ek for each frequency k. And the energy can be carried by two kinds of 
excitation, which we have called electrons and positrons anticipating the later results of the theory. We see 
already that these two excitation states have particle properties, and that there are two kinds of particles. 
The absorption operators are 

bku for electrons 
bku fo'" positrons 

the emission operators are 

bku for positrons 
b%u for electrons 

Charge- Symmetrical Representation 

We use the charge-conjugate field </> defined by (51), in order to put the whole theory into a form where there 
is complete symmetry between electrons and positrons. This symmetry is known as the charge-symmetry 
of the theory. 

Let k be given, and a spinor u satisfying (308), representing a positron state. We represent the positron 
state alternatively by the charge conjugate spinor 

V = Cu+ (316) 

which will satisfy (307) as u does for electron states. We denote by 

^kv = Hu 

the absorption operator for the positron state v. Then instead of (306), (309) we may write the pair of 
equations 

Mx) = qJ d'kl^j^^y' I^J2^^e^''''^^v + J2{^^^^ (318) 
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The tp and fields are thus entirely symmetrical between positrons and electrons; could be taken as the 
starting-point and tp derived from it, just as easily as vice versa. 
The commutation rules (311) become 

{hu, bl,^,} = S^{k-k')Suu' 

{bZ,blg,}=S'{k-k')S,,, (318a) 
{C',&L}=0, etc. 

These are also symmetrical between electron and positron. 

The Hamiltonian 

The Hamiltonian H has commutation rules with the emission and absorption operators as for the Maxwell 
field. These rules come straight from the Heisenberg equations of motion for ^ and (j). For any electron 
state u or positron v 

[bku, H] = Ekbku [bkv, H] = Ekbk^ 

[Ku,H]=-Ekblu [bl'^,H] = -Ekbl^ 

Hence the Hamiltonian of the theory is 



H 



fd'kEjj2 ^kuhu + ^Ivbg \ (320) 

•' [ u+ v+ J 



as one can verify at once by substituting into (319). 

The additive constant is again chosen so that (H)^, the expectation value of H in the vacuum state, is zero. 
This eliminates a possible arbitrary additive constant from H. 

In (317), (318) and (320) there is complete symmetry between electrons and positrons. The theory could 
just have well been built up from the positron as the fundamental particle, instead of from the electron. 

But for practical calculations wc shall generally not use (317), (318), (320). It is generally easier to work 
with the unsymmetrical form of the theory, with the fields tp and ^. 

Failure of Theory with Commuting Fields 

Suppose we had taken the theory up to this point, only assuming tp and %p to be ordinary commuting 
fields. Then u and v would be taken to be quantities commuting with all tp and tp, in the relations (296). 
Thus (298) and (299) would still be true only with commutators everywhere instead of anticommutators. 
Likewise (310) and (311). However in this symmetric representation, instead of (318a) we should have 

[ bku, bl,^, ] = 6^{k - k') 6uu' 
[bg,blZ,] = -5\k-k')S.,, 

And then the Hamiltonian instead of being given by (320) would have to be 

H = I d^kEAY^bl^bku-Y. &l I (322) 

So positrons would actually be particles of negative energy, like the negative energy electrons of the one- 
electron theory. This is physically inadmissible. 

Thus the use of anticommuting fields is the only thing which gives us a proper positive energy for the 
positrons. This is reasonable, because the intuitive Dirac hole theory can only work by virtue of the Pauli 
exclusion principle, and the exclusion principle is a feature of anticommuting fields. 
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The Exclusion Principle 

Take any creation operator As a special case of (310) wc have identically 

KuKu = (323) 

Given any state 5*, the result of creating two electrons with frequency k and spin ?i in that state is 
^fcM^feii* = 0. Thus there are no states in which two electrons have the same momentum and spin. So we 
have the Pauli exclusion principle valid for both electrons and positrons. Also, an electron and a positron 
do not exclude each other. 

It is a very great success of the general field theory, that it has given us the Pauli principle automatically, 
not by special hypothesis as in the old particle theory of electrons.*^ 

The most general state of the fields is described by specifying for each electron and positron state the 
number of particles occupying it. This number in each case can take just the two values and 1. 



The Vacuum State 

The vacuum state is defined by 

bku^o = and thus = for electron states u 

Ku^o = and thus bku = for positron states u (323a) 

So the vacuum expectation value of products of emission and absorption operators are given by (311). We 
find using (323a) 

{bkubk'v)o = {bkubk'v)o = 

ihubU), = Qu S^{k - k') 5uv (324) 
{Kubk'v)o = {'^-®u)5^{k-k')5uv 

where 

Q I = 1 for electron states u, 
" 1 = for positron states. 

Hence by (306) and (309) the expectation value (^ijjaiz)^' fjiv)) ^ is just the part of the anti-commutator 
{■ipa{z), ^fjiv)} which contains positive frequencies exp[ifc • [z — y)], with ko > 0. Thus, similarly to (221) 

{Mz)i;i,{y))^ = -iS+p{z-y) (325) 
S+{x) = J e^'^-" + in) 5{k^ + n^) e{k) d^k (326) 



where 



writing as before 



e{x) = 



+1 for cco > 
: for a;o < 



5+ = i (5 - i5(i)) 5- = i (5 + (327) 

{i^^{y)M^))^ = -tS-^{z-y) (328) 

S^'\x) = J e''^-^ + ill) + /i') d^k 



(330) 
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S-{x) = J e''=-^ + ill) S{k^ + m') e(-A:) d'^k 



(330a) 



These resuhs for the Dirac theory without electromagnetic interaction will be used a lot when we come 
to the complete quantum electrodynamics with both Dirac and Maxwell fields quantized. Meanwhile, we 
should say a little about the theory of quantized Dirac particles in a given c-number Maxwell field. 

Field Theory of Dirac Electron in External Field 
Lagrangian 

^ = ^D- i^A^'>l> (331) 

Field equations 



a|; + |^SI+».}* = (332) 



i:(£-£^5^)7.-.* = (333) 
These equations are still linear, the being given functions of position. This makes the theory still simple. 

Covariant Commutation Rules 

Because of the linearity, a change vtjj{z) in ^ makes no difference to the field equation for ipiy)- Therefore 
for every two space-time points y and z we still have, as in (298) , 

{Mz), My)} = {V^aW, = (334) 

Beyond this point not much can be done with the theory for general time-dependent potentials A^. 
In practice when we have time-dependent A^ we always use perturbation theory starting from free field 
formalism, assuming the A''^ to be small, or else use special tricks for particular problems. 

The important practical cases in which the are not small are always those in which the A® are 
time-independent in a particular Lorentz system. Examples: electrons bound in atoms by static Coulomb 
forces, electrons moving in constant macroscopic electric and magnetic fields. 

So we assume A'^ = A'^{r) time-independent. We also assume the A^ to be physically well-behaved so 
that the stationary eigenvalue equation 



3 



d 



En^n = <( -e$ + ^ + ""^^ ' + '^''''^ ^ ^" ^^^^^ 

regarded as an equation for the c-number Dirac wave- function i/^nir) has a complete set of eigenfunctions 
tpn with eigenvalues En, the spectrum may be either discrete or continuous or mixed. Equation (335) is 
derived from (332) by substituting into it the particular function 

'^i' = '^i'n(r-)exp|-i^f| (336) 

We assume further the potentials to be such that the eigenfunctions tpn separate clearly into two classes, 
the tpn+ with positive i?„ and the ipn- with negative E^- This is again true for all physically occurring 
potentials, though it would fail for the Coulomb field of a point nucleus with charge Z > 137. 

The time-independent potentials make the problem essentially non- covariant, and so we shall use the 
non-covariant notations in developing the theory. We write the equations as if all the levels n are discrete, 
thus J2n nieans a sum over discrete levels plus an integral over continuous levels suitably normalized. We 
are now chiefly interested in discrete levels, and so we need not bother to write the formulae for normalizing 



EXAMPLES OF QUANTIZED FIELD THEORIES 



67 



the continuous levels explicitly. This makes the picture apparently simpler than the free-particle theory, 
where the normalization of continuous levels was done at every stage accurately; the simplicity is only 
apparent, because we shall just be ignoring the complications arising from continuous levels. 
The general solution of the field equations (332) is 

ij{r,t) = J2^nMr)e^pl-i^t\ (337) 

where the 6„ are operators independent of r and t, and can be both positive and negative. 
By (334) we have 

{bm, K} = {b*^, bl} = 

where b'^ is the Hermitian conjugate of bm- We take the tjjn normalized so that 

rmir)Mr)d^r = 5nm (338) 



i.e. in the usual non-covariant way. Note especially in (339) that we are treating all levels as if they were 
discrete; this can be done for example by enclosing our whole system in a finite box. 

We want still to find the continuous rules between ip and tjj*, or between 6„ and Suppose we add 

to the Lagrangian (331) 

6^{r, t) = eS{t - to) V'*(r, to) Mr) u (339) 

where u is an operator anticommuting with il) and ij)* as in (288). This produces a change in the field 
equation for which now becomes 

{Eta H- 1^1) ^.y^^Sit- to)PMr)u (340) 

So the change Sip produced in tp by the addition of SJf satisfies (341) with the initial condition 5ip{r, t) = 
for t < to- (Compare with (198) and (290).) Now the solution of (341) will obviously be of the form 

Sip = a{t)iPn{r) (341) 

where a{t) is a function of t only, because the right side of the linear equation also has this form. Substituting 
(342) into (341) and using (335) we have 

- En] a{t) = -€S{t - to)u (342) 



and hence 



as we can verify using 



dt 



S^='le{t-to)ipn{r)expl-i^{t-to)}u (343) 



leit-to) = 6{t-to) 



Integrating (340) over space-time gives by (339) 

c//*^,M).V., = „*;exp{.f..}. ,344) 

For t > to, (343) is the change produced in ip{r, t) by adding (344) to the action integral. Hence by the 
Peierls commutation rule, using (193), (343) and (344), 



[b^u,ip{r,t)] = -V'„(r)exp|-i^i| 
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and therefore 

^ ^ ^ (345) 



because by assumption the u's anticommute with the ^'s. The time to no longer appears in (345), checking 
the consistency of the method. 

Multiplying (345) by V'n(^') exp{—iEnt'/h} and summing over n, we have^'' 

{Mr,t), rfi{r',t')} = J2^na{r)rnf3{r')expl.-z^{t-t')\ (346) 

This is the general commutation rule which reduces to (299) in the special case of free particles. 
Multiplying (345) by tpmi^) ^'^^ integrating over r, we have 

{bm, b*J = 5nm (347) 

which is identical to (311) in the case of free particles when normalizations are properly treated. 

The Hamiltonian 

As before, the 6„+ are absorption operators for electrons and the 6*_ absorption operators for positrons, 
only the electrons and positrons being defined by the bound wave- functions. The vacuum state is given 
by 

6„+*o = 6:-*o = (348) 

And the total Hamiltonian of the system, in order to give the correct commutators with the bn and the 6* 
and also to have zero expectation value in the vacuum, is 



H = J2 Enblbn - J2 ^nbnbl (349) 

n+ n — 

= Enb^bn + 5^ \En\bnb*^ (350) 

n+ n — 

From this Hamiltonian it is clear that the system is just a superposition of non-interacting particle-states. 
In each particle-state independently there is a number of particles given by 

Nn = b^bn for electron states 
Nn = bnb^ for positron states 

From the commutation rules (338) and (347) 

= N„ (351) 

so each has just the two eigenvalues 0, 1. This describes exactly the physical situation in a many-electron 
atom where each atomic level may be full or empty independent of the others. 
If each A'' is represented by the (2 x 2) diagonal matrix 

Nn=(l (352) 



then we have 



, ,'0 1\ fO 



oj \^1 

= (; o) <- - (o ;) (-^) 
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This gives an explicit matrix representation for the operators. Each of the states n has its own two-valued 
row and column index. Thus for an atom with M levels altogether the operators would be represented by 

(2^ X 2^) matrices. 

Once having got the Hamiltonian (350) and the stationary states tpn the theory of many-electron systems 
is completely straightforward. We see that the levels of the hydrogen atom given by the one-electron Dirac 
theory arc still exactly valid in this many-electron theory. Only now the Hamiltonian (350) having positive 
eigenvalues, the negative energy states give us no disasters. The positrons appear with positive energy, so 
that all the results of the Dirac theory are given to us simply and automatically. 



Antisymmetry of the States 

We know that in elementary quantum theory of many-electron systems we have to represent the system 
wave-functions by determinants of one-particle wave-functions, so as to make the system wave-functions 
always antisymmetric in the particle coordinates. Wc shall no longer need to make any such arbitrary 
choice of wave-functions in the field theory, all the results of the antisymmetry being given automatically 
by the theory. 

For example, consider an atom with 2 electrons in states i/ji and ip2, all other states being empty. Then 
the state of the system is given by 

* = blbl^o (354) 

where is the vacuum state. In (354) there is no arbitrariness; an interchange of the indices 1 and 2 will 
only change ^ into — ^ which means no physical change. Now consider a 2-particle interaction operator 

^=hjj '^'^i ^'^2 {rinmri)} V{ri - r2) {V*(r-2)V'(r-2)} (355) 

For example, V may be the Coulomb potential between two electrons, which is not included in the La- 
grangian (331). The ^ is put in so as to count each pair of points ri, r2 only once. We calculate the matrix 
element of V for a transition from ^ to & state 



where the 2 electrons are in two other states xl^^ and ^/'4. This matrix element is 

M={^lMhVblhl^o) (356) 

Expanding V by (337) into a sum of products of 6„ and 6* , a contribution to (356) will come only from 
the 4 terms in V proportional to 6i 62^364. Using the anticommutation rules, we have 

Mh2Vhlbl^) = -1 
{^'*Mb2Vblbl^) = l, etc., (357) 

Hence adding up the 4 terms 

M= // (iVidV2V^(ri-r2){{V'3(^i)^i(^i)}W(^2)V'2(r2)} 



(358) 

- {V'3(n)V2(ri)}{V:(r2)V2(r2)}} 

This is exactly the result, direct minus exchange interaction, which would be given by using antisymmetrized 
wave- functions. 

The field theory thus gives the full force of the Fermi statistics for electrons. And we could have shown 
in the same way that it gives the Bose statistics for photons. 
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Polarization of the Vacuum 

Because of the possibility of exciting the vacuum by creating a positron-electron pair, the vacuum behaves 
like a dielectric, just as a solid has dielectric properties in virtue of the possibility of its atoms being excited 
to excited states by Maxwell radiation. This effect does not depend on the quantizing of the Maxwell field, 
so we calculate it using classical fields. 

Like a real solid dielectric, the vacuum is both non-linear and dispersive, i.e. the dielectric constant 
depends on the field intensity and on the frequency. And for sufficiently high frequencies and field intensities 
it has a complex dielectric constant, meaning it can absorb energy from the Maxwell field by real creation 
of pairs. 

We calculate the dielectric constant only in the linear region, i.e. assuming weak fields. The critical field 
for this problem is 

= « IQie ^ (from eE- — ^ mc^) (359) 

eh cm mc 

and in fact the linear theory is good enough for almost all problems. The important case where it is 
not good enough is the propagation of photons through the intense Coulomb field round a heavy nucleus 
like lead. Then the non-linearity produces a scattering of photons which is small but has been detected 
experimentally by Wilson. [15] 

We calculate the dispersive effects exactly, i.e. with no restriction on the frequency. Since the treatment 
is linear, the imposed Maxwell field may be supposed to be given by the potentials of a plane wave whose 
amplitude increases slowly with time 

A'j^{x) = exp{ig • x + SoXo} (360) 

Here e and q are given vectors, 5o is a small positive number. This exponentially increasing amplitude 
is put in so that the potential efi'ectively acts only for a finite time before any given time at which 
observations may be made. This enables us to fix the initial conditions of the problem unambiguously. At 
the end of the calculation we shall go to the limit i5o = 0. 

The vacuum polarization is the effect of the fiuctuations of the quantized electron-positron field on a 
given Maxwell field. The Lamb shift is the effect of the fluctuations of the quantized Maxwell field on a given 
electron. The two effects are just opposites of each other, the roles of the two fields being interchanged. 
Thus we can treat the vacuum polarization now conveniently with the theory of the quantized electron field 
alone. The treatment will be relativistic and so more correct than the treatment given for the Lamb shift. 
Later, to have a complete theory of both effects, we shall quantize both fields together and consider the 
reaction of each on the other. 

Historically the electron self-energy (Lamb shift) and the vacuum polarization were the two problems 
on which the theory broke down because of divergences. Schwinger showed that the vacuum polarization 
could be calculated and was finite, if one used the same kind of renormalization idea that made the Lamb 
shift also finite. 

The electron field operator tpH in the field (360) satisfies (332). Here i/jh is the operator in the Heisenberg 
representation. Now a solution of (332) correct to first order in A'^ is 

^h{x) = ^{x) + ^Jdx' Snix - x')4\x')ij{x') (361) 

Here Sr is given by (293), (301), and (303), and il>{x) is a solution of the free-field equation (286). In fact 
tp{x) is the field operator of the interaction representation, when the effects of are represented in the 
wave-function instead of in the operators. Using the retarded potential in (361) means that the unperturbed 
states are specified in the past, as the initial states upon which A'^ later produces effects. Thus the vacuum 
state defined by (323a) is the state in which initially no electrons or positrons are present. This is the state 
which we wish to study, and we call it \I/o- 

Using the interaction representation operators ipi^), is the vacuum state and remains so for all time; 
the physical state is initially but does not remain ^o- Using the Heisenberg operators ^Ph{x), \I/o is the 
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physical state for all time; it is initially the vacuum state but does not remain so. In the remote past as 
xo — > — oo, because Sr is a retarded potential tjjH{x) and tp{x) become identical. 

The expression (361) is useful because wc know how to calculate matrix elements of iJj{x) from the state 
\E'o, whereas the matrix elements of tpH do not have any simple form. We also need the adjoint equation 

^h(x) = ii(x) + ^ / dx'ii(x')4\x')SA(x' - x) (362) 

he J 

where Sa{x) is given by (295). 

The total current operator to first order in A/j^ is 

jnH{x) = -iectljff{x)'yij,'ipH{x) 

= j^.{x) + j J d^x' {1P{x)^^Sh{x - x')4"{x')ij{x') + lf{x')4\x')SA{x' - xh^i>{x)} (363) 

Here 

j^{x) = -iec ip{x)^^ip{x) (364) 
is the current operator of the interaction representation. The vacuum expectation value 

iKjf^ix) *o) = (jM(a^))o 

= -iec(V ip^{x)(jf,)pai^a{x)) 

\ 0/..p I Q 

= -iec ^{piij^ficc {ipis{x)ipa{x))^ (365) 

is given by (328) 

(j^(^)>o = -ec Spur {7^5- (0)} 

= J d'k6{k^+n')e{-k) Spur{7^[|*-i/.]} 

= / ^(^' + ®(-^) (366) 

This is a highly divergent integral and is mathematically meaningless. This is one of the difficulties of the 
theory about which one can argue for a long time. 

However there is no doubt that correct physical results are obtained by putting {jfj,{x))^ = simply. 
There are two good reasons for doing this. 

(1) Physical. {jfi{x))^, being the expectation value of the charge-current in the vacuum in the absence 
of all external fields, is known to be zero experimentally. Therefore if we calculated {jfj,{x))^ and fomid 
it were not zero we should simply define the current operator to be — {j^i) o' With this definition the 
expectation value would become zero automatically. 

(2) Mathematical. {i^i{x))^ as we have calculated it is a vector, each component of which is a number 
quite independent of the coordinate system. There does not exist such a vector invariant under Lorentz 
transformations, except for this zero vector. Therefore {jnix))^ = is the only assumption we can make 
which would keep the theory invariant. 

This is a simple example of a method which has often to be used in quantum electrodynamics. When 
a calculation leads to a divergent integral or a mathematically indeterminate expression, we use physical 
arguments or arguments of Lorentz invariance to find a definite value for the quantity we are unable to 
calculate. This is the reason for the great success of the covariant formulation of electrodynamics introduced 
by Schwinger. 

So using this principle we have by (328) 

OXar))„ = Jd^x' Spm{4\x')S-{x' - x^^^Snix - x') + 4"{x')Sa{x' - x)^^S-{x - x')} (367) 
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We use the momentum representation (303) for Sr. But instead of taking the contour along the real axis 
for fco we can use a path of integration running along a straight line parallel to the real axis a distance So 
above it. That gives the momentum representation 



-So 



(368) 



where So is any positive real number, S is the vector with components (0, 0, 0, Sq) and the integral is along 
the real axis. The poles of (368) in the fco plane are displaced away from the real axis and so the integrand 
is free from singularities on the path of integration. Similarly 



=+<5o 



Sa{x) 



(fc - z5)2 + 



,d^k 



(369) 



When using (368) and (369) we shall usually make (5o ^ after carrying out the integrations, so that the 
convergence factors e^^°^° will tend to 1 for every finite x. 
Thus the momentum representation of (367) will be 



{j^cHix))^ = - ^ (2^ J JJ ^^^^ d'^k2exp{iq ■ x' + i{ki - fca) • {x' - x) + SoXo}x 
X |spur{(^(|^i + 1/^)7^(1^2 + 
+ Spur{^(|^^ -i^ + i/i)7^(|^2 + ^m)}' 



^(fc^ + M')e(-fci) 
(fc2 +iSy+ 
(5(fc| + Ai')e(-fc2) 



(fci - iS)^ + ^2 

The integration of x' is immediate and gives (27r)'*(5'*(fci — k2 + q). Hence 



^iq-x+Soxo ^ 



X [ d^k \ Spur{^(|^ + i/f)7^(^ 



+ Spur{^(|^ - + i^J.)-ffj.{i^ + 

Now consider the function^^ 

F^(fc) = Spur{^(|^ + Ui)-f^i^ + d + i^ + ifi)} 



S{k^+H^)Q(-k) 
{k + q + iSY + 
S{{k + qf +^x^'}Q{-k-q) 



(f^+iS + 



(fc - i(5)2 + 



(fc2+/i2)[(fc 



iSY + /i^] 



This has poles at 4 points in the fco plane 



^0 = -go - iSo ± + q\ 



The integral in the expression (370) is just the sum of the residues at the two points 



49 



ko = -^/W+7 



ko = -qo - iSo - Vl^ + 9p + 



® / 








X 

® 


X 

® 



(370) 



(371) 

(372) 
(373) 



® fco = -qo -iS- VlfcTqF+7i^ 
(3) fco = -go ^Ak + qF+7 
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Hence 

U.Hix)), = ^^e^-^^+^o-" F^k) dk (374) 

where means a contour integral in the ko plane drawn as shown in the diagram, going from — oo to +00 
below the two poles (373) and above the other two poles, and including the upper semicircle of the ko plane 
at infinity. So long as > the poles are always clearly separated from each other. Now the calculation 
really starts, with the evaluation of this integral (374). This will be typical of all the calculations which 
have to be done in quantum electrodynamics using the modern methods. 



Calculation of Momentum Integrals 

Write = Fv{k) dk. Then is a vector function of the variables (the mass of the electron) and 

Q = q + i5 (375) 

Jv is certainly an analytic function of /z if (5 > 0, and it is an analytic function of Q if is large enough so 
that the poles (373) always lie to the left of the imaginary axis and the other two poles (372) to the right. 
Therefore we can calculate for large values of /U and 5 = so that Q = q. 

To simpUfy (371) we take ^ = and use Feynman's formula {Phys. Rev. 76 (1949) 785) 

which we can get at once simply from the definite integral of the RHS.^° Hence 

J. = ldz dk Spm{0 + iM)7.(l^ + ^ + ^/^)} {fc2 + ^2 + ,[2fc.g + g^)P 

^ I '^^Ij^ Spur{^(^ - + *m)7.(^ + (1 - z)f + ^M)} |^2 + ^2 + |^_^2)g2p (377) 

In the last step a shift of origin was made in the fc-integration replacing fc by (fc — zq). Again in (377) 

the poles in the k^y-iAanc for every z arc well separated by the imaginary axis, provided that is large. 
Evaluating the spur and dropping terms arc odd in k, and using (33),^^ Spur = 0, Spur 7^71/ = 4(5^1^, 
we get 

, e.(-fc^ -^^' + iz- z^)q^) + 2(e • fc) fc. - 2{z - ^^)(e • q) 
""■^ = V„ Ic {k^ + ,^ + {z^z^)q^V ^'"'^ 

the odd terms going out because we can now take the fco-integration straight up the imaginary axis from 

—ioo to ioo if we like. For the same reasons of symmetry we may replace 

c ' k ku by ^/c g-u bccBiUSG c * k ki/ ^ ^ ^ ^xkxky ^ Gi/ki/ki/ ^ "^^uk 

X 

in the numerator, and obtain finally 

This integral is still badly divergent. So we again use a physical argument to give a definite value to the 
most divergent part of it. The current operator both in interaction and in Heisenberg representation must 
satisfy 
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identically. Therefore (374) gives (since we now take 6 = 0) 



(381) 



which gives the relation 



Cdzf 
Jo Jc 



.lz.2 



dk — 

c {k^ + iJ.^ + {z - z^)q^y 







(382) 



This equation (382) is a guiding equation telling us that a certain divergent expression appearing in (379) 
is to be given the value zero in order to make physical sense. We are left with 



Ji, = ^{q^e^ - e - qq, 
For any positive A the integral 



'.) C dz{z-z^) [ 
Jo Jc 



dk 



C + /z2 + (^ - ^2)^2p 



Ia 



dk 



(383) 



(384) 



Ic {k^ + Af 

is convergent and can be evaluated by integrating for fco up the imaginary axis from —zoo to +ioo. This 
gives (see the Appendix below) 



Ia 



dkidk2dk^dko 



{kf + kl + kl + ^2 + A)3 

Hence integrating with respect to A 

1 



fc3 dk 
(F + A)3 



: dx 



(^ + A)3 



dk 



1 



Ic l(A2+A)2 (P+/x2)2 

this integral being also convergent. However 

dk 



TT i log ( — 



if 



(386) 



Ic {k-^+lJ?Y 
is logarithmically divergent for large k. Its value is 

krni 



2i7r log 



(387) 



where R is the logarithmic factor and is independent of q. 

Using (386) and (387) in (383), with K = + {z~ z'^)q^ , we get 

Jv = 8iT'^i{q'^e„ - e ■ q q„) i^^R - J dz {z - z^) log 



1 + 



(2 - Z2)(72 



(388) 



This is the analytic formula for J^^ valid for large /i in which case the logarithm is real. We make the analytic 
continuation to small values of /i by writing (g' + i^) for q in (388), treating as small and positive. Then 
becomes q^ — 2i5qo and the logarithm becomes 



log 



1 + 



{z — z^)q'^ 



+ < 



(z - z^)q'^ 
for ^ , > -1 

-t7re(go) for 5 < -1 
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Writing now z for 4(z — z"^) and using (374), we go to the limit do = and find^^ 



1 ??_ 1 



log 



ZTT , , /■" 



the last term being zero except when 



(389) 



(390) 



Now the external potential Af,{x) is associated with a classical external charge-current density 



d 



92 



dxvdxx 



= c{q'^e^-e-qq^}e"^'' 



(391) 



Hence (389) gives the final result, with a = 



1 



137 'i-irhc 



, (Heaviside units) 



2 1 

{ji^H{x))„ = -aj^E{x) \ —R - — 



^ zdz 



477 Jo Vl - -2 



log 



£1. 

4/x2 



4e(9o) 



(392) 



Appendix 



The four dimensional volume element is (see (385)) d£^id£^2d^3d^4 = dr in four dimensional polar 
coordinates. To show this, wc denote the surface of a p dimensional unit sphere by lo. Then the surface of a 
p dimensional sphere of radius R is Rp~^lo, and hence the volume element in polar coordinates is u)R^~^dR. 

To calculate the value of w, we compute the p dimensional Laplace integral in Cartesian and polar 
coordinates. We have 



J 



On the other hand 



and thus by comparison 



■ ■ / |- E '^^i^^2 ...d^p = [v^Y 

J = UJ [ 

Jo 



P dp = uj 



e p 



r(p/2) 



UJ = 



27rP/2 

r(p/2) 



For p = 4, we have r(2) = 1,0; = 27r2. For p = 3, we have r(|) = lo = = An, etc. 

2 v'^/^ 

Physical Meaning of the Vacuum Polarization 

We now discuss the various physical effects arising from the calculation of 

(j'^Hix))^ 

1) The result is completely gauge-invariant. This may be seen at once from (391): for if a gradient 
dA/dXi, is added to Af,, there is no change in j^^. 
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2) If the relation (382) had not been used in order to simphfy (379), we should have had an addition to 
{jvH{x))^ of the form K'e^ = KA^{x) (see (360)) where K is an indeterminate numerical factor containing 
the divergent integral standing on the left of (382). That is to say, an induced current proportional to the 
inducing potential. This would have destroyed the gauge-invariance of the result \i K ^ Q. Therefore we 
can also use the physical requirement that the results must be gauge-invariant in order to give the value 
zero to the indeterminate K. 

3) The energy density of the vacuum arising from the polarization by the potentials A^, (x) is 



(393) 



Thus the term KA1{x) would give an energy density 

-^J^Al{x)Al{x) 



(394) 



associated with the electromagnetic potentials. This would give to the photon a finite rest-mass, and for 
this reason K is often referred to as the "self-energy of the photon" . There is a lot of discussion in the 
literature of this photon self-energy. But since physical arguments lead us definitely to give K the value 
zero, we have no choice but to say that the photon self-energy also is zero. This is of course the result that 
any consistent theory of electrodynamics must give. 

4) The logarithmic divergence i? is a real divergence and cannot be given the value from physical 
arguments. However, it gives only an induced charge exactly proportional to the inducing external charge. 
It is never possible experimentally to separate the external charge from the proportional induced charge. 
Therefore in all measurements of the external charge, the measured charge will be not jvE{x) but 



3ur{x) = 1 



2a 
3^' 



R juE{x) 



(395) 



juR here meaning "renormalized charge" . Thus the effect of the term R in (392) is only change the unit 
in which the external charge is measured. We write the results in terms of the observed external charge 
juR instead of the unobservablc juE, this change of units being called "charge renormalization" . Note 
the similarity between charge and mass renormalization. In both cases a divergent effect is recognized 
as producing no observable phenomenon, because it only changes the value of one of the fundamental 
constants, in the one case the electron mass m and in the other the imit of charge e. Since m and e are 
things which are observed directly, the divergent effects disappear completely when the results are written 
in terms of the observed m and e. Thus (392) becomes 



{3uh{x))„ = aj^R{x) 1^ ^ 



z dz 



:l0g 



1 + 



zq 
4/^2 



; dz 



(396) 



Everything here is now finite and observable. 

5) When A'l{x) is the potential of a pure radiation field without sources, jyR = and so there is no 
polarization. Thus for every photon or freely travelling wave, the vacuum behaves like a true vacuum; there 
arc no dielectric effects of any kind. This agrees with the common-sense idea of the vacuum. The result is 
however only true so long as the polarization can be treated as linear; if non-linear effects were included 
then two beams of light crossing the same region would produce a polarization current, giving rise to a 
"scattering of light by light" . The light-by-light scattering has been calculated, it is not zero but is much 
too small to be observable. 



6) The induced current (396) consists of two components, the first in phase with the potential A^{x) 
and the second out of phase by 7r/2. Speaking of the vacuum as a circuit being driven by the potential 
A%{x), the first is an inductive effect, the second a resistive one. So only the second term gives an absorption 
of energy by the vacuum from the driving potentials. 
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We study the energy balance, remembering that the classical potentials A'^{x) must always be real 
quantities like 

Al{x) = cos{q ■ x) (397) 
and we take without loss of generality qo positive. Then (396) gives, taking e • g = 0, 

{jvH{x))^ = {Acos.{q ■ x) +Bsm{q ■ x)} (398) 

where A and B are real and 

B = \acq^ / -j== (399) 

The energy supplied by the potentials to the vacuum per unit volume and per unit time is 

1 , , dAlix) 

V 

= —qo [A sin(g • x) cos{q ■ x) + B sin^ {q ■ x)] (400) 

Prom (400) one sees that the in-phase current gives no net absorption of energy, while the out-of-phase 
current gives a mean energy supply per unit time 

E = -^,qoe^B = (401) 

If q does not satisfy (390), i.e. if 

qo < VV + kP (402) 

then B = and E = Q, there is not enough energy in the vibrations of the field to create a real positron- 
electron pair whose rest-mass alone requires 2mc^, given that the field supplies in the interaction a momen- 
tum hk together with the energy hcqo- 

However if (390) is satisfied, then there is enough energy for creation of real pairs, each real pair carrying 
energy hcqo- Since q is time-like and e • g = 0, e is space-like and (e^) > 0. This can be seen as follows: 

e-q = = e- q- eoqo 

Sinc;e q is timclike, we can use a Lorentz transformation which makes q — 0; then, of course, 50 7^ 0. But 
then we must have cq = 0, which means that e is spacelike. Therefore E > 0, verifying that the potentials 
can never extract energy from the vacuum. And we have the probability per unit volume and per unit time 
that the potential (397) will create a real pair 



hcqo 8/1 



The result could of course have been found more easily by elementary methods. The thing I want to stress 

here is that the elementary real pair-creation processes are necessarily tied up with the less elementary 
vacuum-polarization effect given by the A term in (398), and the A term will exist whether or not real pair 
production is possible. The situation is quite the same as the tying up of the elementary line-width effect 
in atomic spectra with the less elementary line-shifts; we discussed these effects in detail earlier. Therefore 
we have quite as much reason to take seriously the vacuum-polarization effect given by the in-phase current 
in (396), as we have to take seriously the Lamb shift itself. Because physicists were unwilling to take these 
two effects seriously, physics was held up for quite a number of years. 
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Vacuum Polarization For Slowly Varying Weak Fields. The Uehling Effect. 

Let now the external potential A1{x) be not only weak but also slowly varying in both space and time, i.e. 
let it be a superposition of Fourier components (360) with 

« ^2 (404) 

Then by (390) the second term in (396) is zero^'^ and the logarithm may be expanded in terms of {q^ / ^'^). 
Keeping only the term of order 



But in each Fourier component ofj^R{x), operating with the D'Alembertian gives a factor (— g^). Hence 
the result, independent of the Fourier decomposition, valid for slowly varying fields 

{jM^))o = {D'jM^)} (405) 

Consider the effect of (405) in the case of the hydrogen atom. The proton^^ is represented by the static 
charge-density pp{r), and this induces a charge in the vacuum whose density is 

PiN(r) = -J^'^^PPi'^) (406) 

The electrostatic potential of the proton is thus V{r) + 1^n(?')) where 

VV(r) = -pp{r) 
VViN(r) = -piN(r) = ^^VVp(r) 

and hence 

Thus for a point proton, the potential added to the Coulomb potential by vacuum-polarization is 

Vm{r)=+-^6-'{r) (408) 

And the change in the energy of a state of the hydrogen atom with wave- function ■ip{r) is 

This is just the same as the formula for the Lamb shift, (284), with (—1/5) instead of the logarithm. Thus 
it is a factor 40 smaller than the Lamb shift, and in the opposite direction, —27 megacycles in the total of 
1062. Still the experiments are good enough to show that the effect is there all right. 

The result (409) was calculated many years ago by Uehling [17] using old-fashioned methods. 



IV. Field Theory Of Dirac And Maxwell Fields In Interaction 
The Complete Relativistic Quantum Electrodynamics 

Wc now take the combined system of Dirac and Maxwell fields in interaction, and make out of it a relativistic 
quantum theory using the methods we have already developed. This will then be the complete theory 
of quantum electrodynamics, applicable to all problems in which electrons, positrons and photons are 
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concerned. We also include in the theory a classical Maxwell field, which acts on the electrons and positrons 
and represents the effects of external charges such as protons which may happen to be present. 
Lagrangian 

^ = ^D+^M- ie^Jiip - ie^A""^ (410) 

Here we use Ai,{x) for the Maxwell potential operators and A1{x) for the potentials of the classical external 
field. 



Field equations 



^^^^al: + S(^^ + ^^^K^^V' = o (411) 



A 



i:{a|:-S(^^ + ^A)}^7.-M^ = (412) 

U^A^ = ieV^7^V (413) 

(See (384). )55 

These equations are non-linear. And so there is no possibility of finding the general commutation rules 
of the field operators in closed form. We cannot find any solutions of the field equations, except for the 
solutions which arc obtained as formal power series expansions in the coefficient e which multiplies the non- 
linear interaction terms. It is thus a basic limitation of the theory, that it is in its nature a perturbation 
theory stating from the non-interacting fields as an unperturbed system. Even to write down the general 
commutation laws of the fields, it is necessary to use a perturbation theory of this kind. 

Since the perturbation theory treatment is forced on us from the beginning, it is convenient not to set 
up the theory in the Heiscnberg representation but to use the interaction representation. The IR is just 
designed for a perturbation theory in which the radiation interaction is treated as small. In the IR the 
commutation rules can be obtained simply in closed form, and so the theory can be set up with a minimum 
of trouble. 

There arc two different interaction representations which we can use. The first may be called the Bound 
Interaction Representation. It is exactly the representation we used in discussing the radiation from an atom 
in the non-relativistic theory. We take all field operators to have the time-dependence of the Heiscnberg 
operators in the theory of the free Maxwell field and the electron field interacting with external potential, 
only the interaction between the two fields being omitted. Thus the field equations in the BIR are (332), 
(333), and 

U^A^ = (414) 
The wave- function $(f) in the BIR satisfies the Schrodinger equation 

ih^ = (415) 

Hnit) =ie j ^(r, i)4(r, t)i){r, t)d^r (416) 

This Hjj{t) is just the difference between the Hamiltonians of the theories with and without the radiation 
interaction. Because no derivatives of field operators occur in H^, the difference is just minus the difference 
between the corresponding Lagrangians, and so has the simple form given by (416). (Compare with (243).) 

Using the BIR wc can discuss the radiation of light by an atom, as it was done before, but now treating 
the atom relativistically. In fact this representation must be used, as soon as we wish to calculate effects 
accurately enough to require exact Dirac wave-functions for the unperturbed atomic states. However, the 
BIR is not convenient to use, because the commutation rules for the electron field are given by (346) and 
are still too complicated for all but the simplest problems. Therefore we shall use the BIR only when we 
are compelled to, and then usually only in the final stages of a problem. Generally we can do the main part 
of the work, with the major calculations, in the second type of interaction representation. 
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Free Interaction Representation 

Here we take all field operators ^, tp and to satisfy the free field equations (286) and (414). The 
commutation rules are then also given by the free- field formulae (203) and (298), (299). The wave- function 
satisfies the Schrodinger equation 



with Hjj formally again given by (416). But here Hji is not the same operator as in (415) because of the 
time-dependence of the ip and ip in the two cases. 

This FIR is the interaction representation normally used in quantum electrodynamics, and will be called 
henceforth the Interaction Representation or IR simply. It is very well siiitcd to rclativistic calculations 
because it makes the field-commutators and expectation values invariant functions. Thus the calculations 
can be explicitly and formally invariant, even when the potentials Af, are given in a special Lorentz frame 
as in the hydrogen atom. 

It was Schwinger and Feynman who first discovered the importance of making calculations formally 
invariant when using a relativistic theory. They made this discovery in completely different ways, charac- 
teristically. Feynman simply found that the calculations become much easier and simpler when done in a 
way which did not conceal the invariance of the theory. This is still true, in fact the main reason why we 
can tackle now more difficult problems than we could 10 years ago is just that the calculations with the new 
methods arc so much shorter. But the greater and essential advantage of the covariant calculations, pointed 
out by Schwinger, is that they enable the separation of finite observable effects from infinite renormalization 
terms to be made in a clear and unambiguous way. We had an example of this in the treatment we gave 
of vacuum polarization, where the covariant type of calculation was used. The divergent term (382) could 
be separated clearly from (379), because of the way in which (379) depends formally on the vectors and 

If the calculation had been done in a non-covariant way we could not have used (381) as we did. 

So now we shall apply the covariant methods, working in the IR, to solve a number of the standard 
problems of electrodynamics, in increasing order of difficulty. 



ih— = {H^{t) + HR{t)}^ 



(417) 




(418) 



Free Particle Scattering Problems 



In this extensive class of problems we are interested in calculating the over-all transition matrix element M 
between an initial state A and a final state -B, A and B being specified by assigning the spins and momenta 
of the free particles which are present in these states. The scattering process is supposed to occur as follows: 
The free particles which are specified by state A in the remote past, converge and interact, and other free 
particles emerge or are created in the interaction and finally constitute the state B in the remote future. 
We wish to calculate the matrix element M for this process, without studying the equations of motion or 
investigating the behavior of the system at intermediate times while the interaction is in progress. 

The unperturbed states A and B are supposed to be states of free particles without interaction and are 
therefore represented by constant state-vectors and $b in the interaction representation. The actual 
initial and final states in a scattering problem will consist of particles each having a self-field with which 
it continues to interact even in the remote future and past, hence <^a and <I>b do not accurately represent 
the initial and final states. However, so long as we are using perturbation theory and not including the 
higher order effects arising form the self-fields of the particles, it is consistent to \isc constant <I>^ and 
<^B representing bare particles without radiation interaction. Even when self-field effects are considered, it 
turns out that the bare-particle $a and $ b can still be used, although in this case some careful justification 
for it is needed. 

The matrix element M is 

M = ($^f/$A) (419) 

Here U^a is the state which is obtained a,tt = +00 by solving the equation of motion (417) with the initial 
condition <I> = ^a at t = —00. U can easily be written down as a perturbation expansion in the operators 

H"" and Hr, 

U = l+ (-l) r dh {H%h) + Hnih)} 

2 (420) 
J "^^^ J dt2{H%h) + HR{ti)}{H-{t2) + Hn{t2)} + ... 

--] - dh... dt^P{{H'{t,) + Hii{t,)}...{H^t„) + HR{t^)}} (421) 

Here the P signifies a chronological product, the factors in it being multiplied not in the order in which 
they are written but in the order of the times ti,t2, ■ ■ ■ ,tn, the factors with later times standing to the left 
of those with earlier times. This accounts for the factor 1/n! after we change all the limits to cover the 
whole range from —00 to +00. The operator U is generally referred to as "the S-matrix". 

Before discussing the general analysis of the series expansion (421), we shall use it to solve some standard 
problems. 

A. M0ller Scattering of Two Electrons 

We have in the initial state A two electrons in states {piUi){p2U2) and in the final state B two electrons 
(pWi)(P2^2)- The electron (piUi) is given by the one-particle wave-function 

uie'f^-^ (422) 



E 

n=0 
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normalized by (itiUi) = 1. With the wave function normahzed in this way, (422) is just the matrix element 
of the operator tjj{x) between the vacuum state and the state containing electron 1. We can see this from 

p,u 

where = V(5„„-. Then ($*, ^A(2;)$p„) = 6p„$p„)we^f^ = {^*,^o)ue'P'= = ue'P-^. 

Thus we consider the states 1, 2 and 1', 2' as if they were discrete states, the tp operator being given by 
the expansion (337). It would also be possible to use the continuous-state expansion (306) for -ip, but then 
the normalization of the initial and final states woidd have to be considered afresh. Since we fixed the 
normalization (472) when we previously derived the M0ller formula (144), we shall now stick to it. 

We shall calculate in the Born approximation as before, thus keeping only the term n = 2 in (421) which 
gives the matrix element M proportional to e^. In this problem the external potential A^ is zero. The term 
n = 2 in (421) is 

U2 = J I dXi dX2 P {^{xi)4ixi)'t(>{xi),i^ix2)4{x2Mx2)} (423) 

the integration going over all space-time. To obtain the matrix element M = ($^f/2^A) we only have to 
replace according to (377) 

i;{xi) = uie^P'-'^'bi + U2e'P^'''b2 + M'e*f'i-^-6'i + u'^e'P'^'^'b'^ by me^P^-'^^bi + uae^f^-^'^s (424) 

and 

i^{xi) = uie-'P'-'^'bl +U2e-'P'-'''b* +u[e-^P'^-'''bl' + u'^^^^ by u[e-'P'^-'''bl' + u'2e-'P'^-'''b;' 

because we only absorb 1 and 2 and only create 1' and 2'. Then we want to pick out the coefficient of 

{bTb,){b;'b2) (425) 

in the resulting expansion. There are no photons in the initial and final states and so the vacuum expectation 
value is taken for the Maxwell potential operators. We thus find, taking account of the fact that the b and 
b anticommute with each other as in the derivation of (358), the result 

= ITT / / '^^i {exp [i{pi - p[) ■ xi + i{p2 - p'2) ■ X2] (ui7AWi)(w27/i'"2) 

TX^" (426) 
- exp [i(pi - • a;i + i{p2 - p[) ■ X2] {u'2'y\Ui){u['yi^U2)} {P{Ax{xi), A^(a;2)})„ 

The expectation-value of the chronological product brings in an important new function, 

{P{Ax{xi),A^{x2)})^ = ^hc6x^DFixi-X2) (427) 
where F stands for Feynman. Also called by Stueckelberg D^, c for causality. [18] 

Properties of the Dp Function 
Since 

P{Ax{xi),Af,ix2)} = l{Ax{xi),A^{x2)} + ^e{xi-X2)[Ax{xi),A^{x2)] (428) 
we have by (203) and (205) 

■ 1 



Df{x) = D^^\x) + ie{x)D{x) = ^ 



„ - inSix ) 
x-' 



= D^'\x)-i{DA{x)+DR{x)} 

= D^'^\x) - 2iD{x) (429) 
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according to (228). Obviously Dp is an even function. It has the property that asymptotically as xo ^ c» 
in the future, Dp = 2iD^ contains only positive frequencies, while as xa — oo in the past. Dp = —2iD~ 
contains only negative frequencies. See Fierz, Helv. Phys. Acta 23 (1950) 731 for a full discussion. 

Thus Dp is the potential arising from a point source disturbance at the origin, when all the potential 
travelling out in the future represents particles created, and all the potential travelling in from^^ the past 
represents particles to be absorbed, all particles having positive energy. It is thus the potential which 
maintains the correct causal time-sequence of events, and in this way it was discovered by Stueckelberg. 
But the definition (427) is easier to understand and use. 

The momentum representation of Dp is 



Dp{x) = 



-2i 



(430) 



The contour of integration here is along the real axis, below the pole at fco = ^l^^l and above the pole at 
fco = in the fco plane: 



We can see this using (429), (207), (208), and comparing (210), (226) and (209) one also knows that 
D^^\x) = I exp(iA; ■ x) with ( — ¥r 



(27r)4 




(430) is always known as a "Feynman integral" . We can also write 



Dp{x) 



{2^Y 



Ak-x 



(431) 



Here the integration is along the real axis for all four components of fc. e is a small positive real number 
and the limit e is supposed to be taken in (431) after the integration is done. Before going to the limit, 
the e term just shifts the poles off the real axis, the pole |A;| going done and the pole — |fc| going up, and so 
the integral is well-defined and non-singular. 



The M0ller Formula, conclusion 

Using (427) and (431) in (426), the integrations over xi and X2 can be done at once, giving a (5-function 
involving fc, and the fc integration can then also be done immediately. The result is 

- E -P'.) (Ml^MfeM „ (^27A^i)(g7A^.) j (432) 

^ nc [pi p^) le (pi P2) J 

Now pi and p'^ are both electron momentum 4- vectors and hence (pi — p[ ) is a space-like vector and its 
square cannot be zero. Hence we can go directly to the limit by putting e = in (432). This gives us the 
M0ller formula (144) allowing for a difference in units in p and e. 

It is clear that the formula comes very directly out of (423), once we know the momentum expansion 
(431) of the Dp function. And we shall find that things are equally simple in other free-particle scattering 
problems. 
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Electron- Positron Scattering 

Exactly the same formula (432) also gives the matrix element of scattering of an electron by a positron. 
We must assume that the electron is initially in state 1 given by 

uie'f^-^ (433) 

and finally in state 1' by 

u'le^f'i-^ (434) 
But now the initial state of the positron is given by the wave-function 

tl'se-^f^-^ (435) 

and the final state by 

tlse-'^^-^ (436) 

using the negative-energy electron wave- functions and not the charge-conjugate functions to represent the 
positron. The correctness of (435) and (436) is clear since 62 is the emission operator and b'2 the absorption 
operator for this positron. 

The second term in (432) now represents not an ordinary exchange effect but a special short-range 
scattering dnc to a virtual annihilation of the positron and electron. This term has been observed experi- 
mentally by measuring the fine-structure constant of positronium. (M. Deutsch and E. Dulit, Phys. Rev. 
84 (1951) 601, (Nov. 1, 1951).) 

B. Scattering of a Photon by an Electron. The Compton Effect. Klein-Nishina Formula. 

Again we use the same operator U2 given by (423). We only have to calculate its matrix clement M2 
between an initial state A and a final state B, where A consists of an electron with wave- function 



ue'P'' (437) 

= e^e^'^- (438) 
u'e'f'-^ (439) 

Af, = e'^e'"'-^ (440) 

The operator A\{xi) appearing in (423) contains both photon emission and absorption components 
according to (211), and so does >l^(a;2). Hence the matrix element M2 is a sum of contributions; we can 
either take e\e'^'''^^ from Ax{xi) and^^ e^e*'^ '^^ from Afi{x2) or vice versa. Likewise the electron can be 
absorbed by tp{x2) or emitted again by tp{xi), or vice versa. Thus altogether we find for M2, after taking 
into account that the whole expression is symmetrical in xi and X2, 



and a photon with potentials given by 
and B consists of the electron in state 
and the photon with potentials 



= X! f f dXidX2 ^e^{ip- X2-ip' ■ Xi){u[jx)^{e{xi - X2)P {lpa{xi),ll}/3{X2)})^ijl^u)^'^ X 

{exe'ij^ exp(ifc • xi — ik' ■ X2) + e^ue^ exp(zA; • X2 — ik' ■ Xi)} 

(441) 

Note that with anticommuting fields the expression e{xi — X2)P {^a{xi), ■(/'^(a;2)} is a relativistic invariant 
whereas the P-product by itself is not. Thus in analogy with (427) we write 

{e{xi - X2)P {V'a(a;i), V'/3(a;2)})^ = -^Spapixi - X2) (442) 
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where Sp is a new invariant function. Since 

e(X2 - Xi)P {tpa{Xl),tpi3{X2)} = ^[tpa{xi) , tp fjixa)] + ^e{x2 - Xi) {tlJa{xi),tlj,^{X2)} (443) 
we have by (299) and (329) 



Sf{x) = S^^'' +ie{x)S{x) 



in exact analogy to (429). We write also 



Sf{x)=Y1 



and we find from (444) the momentum representations 

Af{x) = 



(27r)^ 



ik-x 



Sf{x) = 

We can also write (447) conveniently as 



fc^ + — ie 
^ + in 



d'^k 



^ (27r)4y^ l^-z/x 



(444) 
(445) 

(446) 
(447) 

(448) 



Here the Dirac matrix in the denominator means that we have to multiply above and below by + ifi) in 
order to evaluate the integral. Thus (448) is not a real simplification of (447), only it saves writing. The 
Feynman integration in (448) is defined as a contour integral exactly as in (430). 
Now substituting from (448) into (441), just Hke for (432) 



— e 



M2 = ■^{2Trf6^ip + k-p'- k')u' 



1)1 



f + ^ — i/j. 



(449) 



Again the denominators {p — kY + l-t^ can never vanish so the e can be put equal to zero in (447). In fact 
if the electron is initially at rest 



(p - k'f + /x^ = (p2 +ij^)+k'^ -2p-k' = 2pok'o = 2iik'o 

and similarly 

(p + kf + At^ = -2iiko 
because fc'^ = and p^ = —fi^; p = because the electron is at rest. 



Mo 



2?l2cV 



{2T:)^S^{p + k-p' -k')u' 



ko 



(450) 



(451) 



Now we can simplify (451) further. Since the photon is not polarized in time, 64 = 64 = 0; since the electron 
is at rest, p = 0. Hence e ■ p = 0, and thus f^p = —0 + 2Ie ■ p = —0, i.e. ^ and ^ anticommute. This plus 
the fact that m is a spin state of momentum hp, i.e. 



{p — ijiju = 

mean that the term |^ + i/x in (451) can be omitted. Thus we get 

^2 



M2 = 



2?l2cV 



{2n)^6^{p + k-p' -k')u' 



ko^ 



(452) 



(453) 
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Calculation of the Cross-Section 
We write as in (145) 

M2=K{2Tr)^S'^{p + k-p' -k') (454) 
Then the scattering probabiUty per unit volume and per unit time for the single final state is as before 

W5 = c\K\^{2Tr)'^S'^{p + k-p' -k') (455) 

The number of final states for the electron is 



1 



mc 

1y 



dp\ dp'2 dp'^ 



(456) 



The photon with potentials (440), when e'^ is a space- like vector having (e^)^ = 1 is normalized to one 
particle per volume hc/2k'Q. This can be seen at once, comparing (440) with (211) and (214), and taking 
into account the difference of {2tt)^ between the continuous and discrete normalizations. Hence the number 
of final states for the photon is 

1 



(27r)3 \2k'o 



) dk[ dk'2 dk's 



The total transition probability is thus 



w = c\K?- 



mc^ he \ dk[ dk'r, dk'r. 



(27r)2 V E' 2k'oJ d{p'o + k'o) 



(457) 



(458) 



Wc write this as a probability for scattering the photon with frequency k^ into a unit solid angle dCl. Then 
using momentum conservation, wc have 



dp'o _ p[ dp[ 
dk'r, 



p[ dk[ 



p[k[ + ---+p'3kii 



P'odk'o 



dk'o 



(459) 



Hence finally 



d{p'o + K) -{P' ■ -P ■ ^ Poko 

he 



w = c\K\'^- 



1 



-k'^dn 



(27r)2 2ko ° 

The differential cross-section for scattering the photon of frequency ko into solid angle dCl is then 

WV1V2 



(460) 



where Vi is according to (149) the electron normalization volume 

V2 = ^. Thus 
2ko 



E 



a 



47rfco 



K 



2ft2c2 



K\^k'(fdn 

1' : J' ^ 

^ ITS 



1 and V2 is the photon volume, 

(461) 
(462) 



This gives the cross-section for a known electron spin in the initial and final states. 
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Sum over Spins 

Experimentally we do not observe the electron spins. Hence we observe only the cross-section a obtained 
by averaging a over the two spin states u and summing over the two spin states u'. The summing and 
averaging wc do by the method of projection operators according to (109), (114). 
Here are some rules for spurs and dagger operators: 

^ is a Dirac matrix in general. 

1) Spur (^(1)^(2) . . . ^(2fe-i)) = 0, i.e.the spur of an odd number of factors is 0. 

2) Spur (^(1)^(2) . . . ^(2fc)) = Spur {P . . . ^(2^))} where P is any cyclic permutation. 
This is clear, because any cyclic permutation consists of steps of the form 

and for two square matrices^^ A, B 

Spur^i? = ^^aijhji = Spur 5^4 

3) Spur (^(1)^(2) . . . ^(2fe-i)^(2fc)) = Spur (^(2fc)^(2fc-i) _ . _ ^(2)^(1)^ 

To show this, it is enough to assume that all arc different; by the commutation rules of the 7's wc can 
always reduce the product to this form. Then, since each inversion (of neighbors) brings in a minus sign, 
and since there are an even number of inversions, we get our rule immediately. 

4) = -^ + 2l{a-h) 
In particular 

= I (e • e) = -t-4 = -I^V 

5) By 2), one can cyclically permute a product of dagger operators without changing its spur. 

Now we can proceed to evaluate the sum over spins. We have 



u u' u u 

o4 



e 



til, lit J I / j/ I ■ \ { it ii , lit 



Spur {f + z„) ^j-^' + ^'^^ (/ + z„) + ^'^^ ) (463) 



t -o , ^171 + ^272 + ^373 .r, , 

Now — = z/J H ; = ip + ^k , say. 

ko ko 



Similarly — = i/J + 7^,/. 

Kq 



Since anticommutes with ^, fJ , 7^ and jk', (compare with remark after (451)) wc may write (463) in the 
form 
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because, using 4), and again compare with remark after (451) 



59 



and similarly^*^ 



4' 



k' ' 



SO adding the terms gives 



Now since = 



^2 — (for the photons because they are on the hghtcone, for the electron 



because p = — ), we get for \K\ 

„4 



«0 '*^0 

because 

(*M - 1*) (/ + «m) = «/-'(/ -~f)-y^- fi = - 1^') - 

We consider the second part of (464) first: 



(464) 



^' + ^) = (i^-|>)(^-|i') 



Spur 



^y'iy'{-0-t')-{t-$' 



= 2po{ka - fco) Spur 



= 2^{ko — kg) Spur 



-2(e-e') ( 



-8m ^" (A; • fc') + 4^(6 • e') Spur 



smce 



Sp = Sp h^'^^'it'^] = Sp [-^^^V'^'^ + 2l0t'l:{e ■ e')] = Sp [-|^'^ - 2(e • e 

Hence altogether (464) becomes 



l\lL' ll 



-8/x ^° (fc • k') + Me ■ e') Spur 



A-' ' 



32/1404^4 

But l^^l^ = = and 

ip'-p)^ = {k- k'f = fc2 + -2k-k' = -2k ■ k' 

[P - P? = P'"^ +p^ -2p' ■p = 2ii^ + 2iip'o = 2ii^ + 2ii{-ii + ko- k'o) = 2/z(A;o - k'o) 
Hence k ■ k' = — /i(fco — k'^) and then 



Sp 



= Sp 



fco 

= Sp 



= Sp 



fco fco 
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and so (464) is 



Hence by (461) the cross-section averaged over electron spins is 



(465) 



The classical electron radius is 

rn = 



Hence 

'=W^i^(|)'{^^+4c„V} (466) 

where (f) is the angle between the polarizations of the incident quantum fco and the emitted quantum feg- 

This is the famous Klein-Nishina formula. 

To put a explicitly as a function of the scattering angle 9, we must use the equations^^ 

k ■ k' = — ^(fco — fcg) 

k ■ k' = cos6' — k^kg — fcofco(cos0 — 1) 

A;oA;o(l — cos0) = ii{ko — k'^) 



Put 



Then 



— = 1 + (1-COS0)- 



ko f Photon energy 



(1- cos 0)2^2 \ 

— ^+4cos2<^ 

l-|-e(l — cosb*) / 



^ = yo ' 7^^^ 2 — - (467) 

[1 + e(l — cos6')J 

Thus for large e the scattered photons are mainly unpolarized and concentrated in the forward direction. 
For small e (non-relativistic problem) we have simply^^ 

CT = r^cosVdO (468) 

the classical result. Summing over the two polarizations of the photon k' and averaging over all polarizations 
of k, this gives the cross-section for all polarizations 

W = ^rl{l + cos'^e)dn (469) 

We get this by evaluating ^ X]eSe'(^ ' ^')^- First we have to sum over the two polarization directions of 
photon k'. This summation for three directions would give 

Hence for the two directions perpendicular to fc' we can write 

^(e . e'f = 1 - (e . kf 
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Now we perform the other summation over the two polarization directions of photon k, using the same 
argument. This gives 



-12 -To' 

k '-^2 



(fe • fe )^ 



= 1 + cos^ 



This with the averaging factor of 5 gives (469). 
The total cross-section then is 



(470) 



This non-relativistic scattering given by (468) - (470) is called Thomson scattering. 



63 



C. Two Quantum Pair Annihilation 

Consider a process by which an electron in the state {p, u) and a positron associated with the wavc-fmiction 
(439) are both annihilated, with the emission of two photons given by the potentials (438) and (440). The 
positron momentum-energy 4-vector is then {—hp') so we write p+ — —p'. The positron spinor in the 
charge-conjugate representation is v = Cu'^ . 

This annihilation process will again be effected by the operator U2 given by (423). And the matrix 
element for the transition is exactly as before given by an expression identical with (449) except that k is 
now replaced by — fc, namely 

M2 = -^^^§^5\p + p+-k-k')u'\f^ ^_^' + ^' 



= K{2'KY5'^{p + p+-k-k') 

We consider the probability for this process for an electron and positron both at rest. The result will 
then apply to the decay of a positronium atom, where the velocities are only of the order of ac and can be 
treated as zero to a good approximation. 
Then 

p = p+ = (0,0,0, i/i) 



As in (453) we have 



K = ,^f^u\0i' + i'U)u (473) 

The decay probability per unit volume per unit time into a solid angle for one of the photons is 

2 1 / hc\'^ dkidk2dk3 
'^ = ''1 ' (2^ d{ko + k',) 

(because ko = ji here) in analogy to (458). But now d{ka + k'^) = 2dkQ and so 

w = c\K\''y^-h^c^d^ (474) 
' ' (27r)2 8 ^ ' 

For parallel polarizations, e = e', and 

{0^' + ^'1^^) = + $)^ -2ipf3 

But P has zero matrix element between the spin-states u and u' , which are positive and negative- frequency 
states both of zero momentum. Hence for parallel polarizations 



w = 



(475) 
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For perpendicular polarizations, take coordinate axes 1 along e, 2 along e', and 3 along k. Then 

{0'^' + f!$f}j = /x{7i(-73 + i/3)72 + 72(73 + «/3)7i} = 2/U717273 
Hence for perpendicular polarizations^'* 

'0 -1 0" 

10 
1 
0-10 



2ij,v^ 



J when spins u and v are parallel 
1 2/i\/2 when spins u and v are antiparallel 

We get this latter result by observing that for antiparallel spins the initial wave function is 

, x/2Vv'0> 

(neglecting the "small components" ) and therefore 



^ = 7!Gmi 



-1 

1 



-1 

1 



-1 

1 



(476) 



1 

72 



(10) 



(0-1) 



x/2 



V2 



This is one place where charge-conjugate spinors are useful and necessary! 

Summarizing, we find for electron and positron with spins parallel, in triplet state, the 2-photon decay 
is forbidden. This selection rule is in fact an exact one for positronium in the ground state Is triplet. Only 
3-photon decay can occur and this makes the lifetime ~ 1100 times longer. For electron and positron in 
singlet state, the 2-photon decay occurs with the photons always polarized perpendicularly to each other. 
The probability for the decay, integrating (474) over a solid angle 27r since the photons are indistinguishable, 
is 

-^W^p|;5;;^4„.J (477) 

Formula (477) is for electron and positron normalized to one particle per unit volume. If the density of the 
electron probability relative to the position of the positron is p, then the mean annihilation life-time will 
be^^ (the "classical electron radius" to = e^/(47rmc^) in Heaviside units) 

^ (478) 



Aircr^p 



For the positronium Is singlet state 



8na^ 



Bohr radius 

n 



I37V0 



T = 2 X 137* X — = 2 X 137^ x — ; 

c m,c 



1.2 X 10"^° sec. 



(479) 



For slowly- moving electrons and positrons with relative velocity v, the annihilation cross-section accord- 
ing to (477) will be 

Anrl (^^^ singlet state, (480) 

proportional to - just like neutron cross-sections at low (thermal) energies. 
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D. Bremsstrahlung and Pair Creation in the Coulomb Field of an Atom 

We consider these two important processes together. Given an external potential A* representing the 
Coulomb field, the processes are: 

Bremsstrahlung 

Electron (pu) — *■ Electron {p'u') + Photon (fc'e') 

Pair-creation 

Photon (fc'e') Electron (pu) + Positron (p'^u') 

We treat not only the photon {ke) but also the potential A^ in Born approximation. This is valid so long 
as 

Potential energy x time of transit <C h 

Ze^ r 
or - — X - <C ?i 
47rr V 

The treatment will only be good for relativistic velocities ~ c, and for lighter atoms Z <C 137. In fact for 
heavy atoms {Z = 82 for lead) and v c the error from the Born approximation is about 10%. 

The processes arise in the Born approximation just from the term linear in and linear in A^ in (421). 
This term is 



Ui = ^ JJdxi dx2 P {i;{xi)4{xiMxi), i'{x2)4\x2)i'{x2)} (482) 



The factor i in (423) is now missing, otherwise everything is the same as before. We suppose Af^{x2) is a 
superposition of Fourier components 



where f{k) is a known function of k. For a static Coulomb field all the vectors k appearing in (483) have 
zero 4*"^ component, and is the constant vector (0,0,0, i). We calculate the matrix element Mi for 
bremsstrahlung or pair creation with A'j^ given by the pure Fourier component (438) ; the results are then 
to be afterwards superposed to give the actual potential according to (483). 

For bremsstrahlung the formula for Mi is (449), identically the same as for the Compton effect, or 
integrating over the frequency k 

n c y p — ^ — ijji^ 

The factor 2 difference between (482) and (423) is just compensated by the fact that the photon k'e' can be 

emitted by two operators in (423) and by only one in (482). The bremsstrahlung cross section is then 
calculated by squaring (484) and integrating over k' and p' with appropriate normalization factors. For the 
details see Heitler's book §17.^*^ 

For pair creation the same formula (449) gives the matrix clement Mi, allowing for the fact that the 
roles of the particles are now changed around so that electron {pu) instead of {p'u') is created etc. Thus 
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and integrating over the components of the potential 



^1 = fiP + P+ + ^ ^ ,! . i \ u' (486) 



For the cross-section calculation see again Heitler, §20.^'' 



General Theory of Free Particle Scattering 



Wc have shown how (421) leads to matrix elements for standard scattering processes, from which cross- 
sections can be calculated. In each case we used only the term n = 2 in (421), which happened to be 
the lowest term giving a contribution to these processes. The higher terms n = 4, 6, . . . will also give 
contributions to the matrix elements for these processes, contributions which arc collectively called "radia- 
tive corrections" . It turns out that the results without radiative corrections agree with the experimental 
scattering cross-sections in all cases. The experiments are never accurate to better than a few percent, 
and the radiative corrections are always smaller than the lowest-order terms by at least one power of 
{e^ /A-Khc) = (1/137). Thus the study of the radiative corrections for scattering processes will not lead to 
any directly observable effects. 

Nevertheless wc shall work out a method of calculating the high-order terms of (421). This method turns 
out to be simplest and easiest to explain when we are discussing scattering problems. Incidentally we shall 
see what the radiative corrections to scattering look like, and we shall learn something about the nature 
of radiative corrections in general. Finally at the end we shall be able to use the method of calculation in 
order to find the radiative corrections to the motion of an electron in a hydrogen atom, which is the case in 
which these small effects can be accurately observed, but where the pure scattering theory is not directly 
applicable. 

To avoid unnecessary complications we suppose there is no external field A^. Problems in which there 
is an external field, so long as it can be treated in the Born approximation, can always be simply related to 

problems without external field, just as the brcmsstrahlung matrix clement (484) is related to the Compton 
effect (449). When there is no external field the matrix clement for any scattering process is 



The operators in (488) are field-operators of the interaction representation, the integrations over the points 
xi,. . .Xn extend over all space-time, and the initial and final states A and B are entirely arbitrary. 

We wish to calculate the matrix element M of 5 for a particular scattering process, in which the states A 
and B arc specified by enumerating the particles present in the two states. We must now take properly into 
account the fact that the particles in states A and B, although well separated and no longer interacting with 
one another, are real particles interacting with their self-fields and with the vacuum- fluctuations of the fields 
in their neighborhood. Thus A and B are really time-dependent states in the interaction representation 
and will not be given by time- independent vectors $a and except in the lowest-order approximation. 
(See page 81.) Let ^B{t) be the actual time-dependent state- vector of the state B in the IR. We are not 
interested in the dependence of $B(t) on t. In an actual scattering experiment the particles in state B are 
observed in counters or photographic plates or cloud-chambers and the time of their arrival is not measured 
precisely. Therefore it is convenient to use for B not the state- function ^B{t) but a state- function $b 
which is hy definition the state-function describing a set of bare particles without radiation interaction, 
the bare particles having the same momenta and spins as the real particles in the state B. In the IR the 
state- function $b is time-independent. The question is only, what is the connection between ^ B{t) and 



(487) 




(488) 
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Suppose ts to be a time so long in the future after the scattering process is over, that from Ib to +00 
the state B consists of separated outward- travehng particles. Then the relation between ^B{t) and $b is 
simple. We imagine a fictitious world in which the charge e occurring in the radiation interaction decreases 
infinitely slowly (adiabatically) from its actual value at time ts to zero at time (+00). In the fictitious 
world, the state ^Bits) at time ts will grow into the bare-particle state $b at time -|-oo. Thus 

$s = i^2(is)*s(is) (489) 

where^^ 

■^/e\"l/'°°/"°° — — 

MtB) = 2Z[-r) —\ ■■■ dXi...dXnP{^M{xi),...,^M{Xn)]gB{tl)...gB{tn) (490) 

„=0 JtB 

and gsit) is a function decreasing adiabatically from the value 1 at t = ts to zero at t = oo. Similarly, 
when tA is a time so far in the past that the state A consists of separated converging particles from t = —oo 
to t = tA we have 

^AitA) = ill{tA)^A (491) 
MtA) = 22[r) ~\ dXi... dXnP{^4Hxi),---,^MiXn)}gA{tl)...gA{tn) (492) 

where gA{t) is a function increasing adiabatically from t = -co to t = Ia- 
The scattering matrix element between states A and B is given exactly by 

M={^l{tB)Sll^A{tA)) (493) 
^*f=E(^)-|/ ••• / P{'4'Mxi),...,'<PMxn)] dx^...dxn (494) 

Of course (493) is independent of the times Ia and ts- When Ia and ts are chosen so far in the past that 
(489) and (491) are satisfied, then (493) may be written in the form (487), where now 

5 = n2(ts)5*fOi(iA) 

/ e \ " 1 f°° (495) 

= J2[t) 1 dXi...dXnP{ll)4Hxi),---,1p4^{Xn)}g{tl)-..g{tn) 

^iicy n. J— oo 

and g{t) is a function increasing adiabatically from to 1 for —oo<t<tA, equal to 1 for tA < t < ts, 
and decreasing adiabatically from 1 to for Ia < t < oo. Thus we come to the important conclusion that 
formula (487) for the matrix element is correct, using the bare particle state-functions ^a and <1>b, provided 
that formula (488) for S is interpreted by putting in the slowly varying cut-off function g{ti) to make the 
integrals converge at U = ±oo. The cut-off functions are to be put in as they appear in (495), and then S 
is defined as the limit to which (495) tends as the rate of variation of g(t) is made infinitely slow. 

The main practical effect of this limiting process in the definition of 5* is to justify us in throwing away 
all terms in the integrals which oscillate finitely at U — ±oo. There are however certain cases in which the 
integral (488) is in a more serious way ambiguous due to bad convergence at ti — ±oo. In these cases the 
cut-off functions have to be kept explicitly until a late stage of the calculations before going to the limit 
g{t) = 1. In all cases, if the limiting process is done in this way, the matrix element M is obtained correctly 
and unambiguously. 

The use of bare-particle wave-functions ^a and $b in (487) is thus justified. This makes the calculation 
of M in principle simple. It is only necessary to pick out from (488) the terms which contain the right 
combination of elementary emission and absorption operators to annihilate the particles in A and to create 
those in B. We shall next describe a general method of systematically picking out those terms, which is 
due originally to Feynman. It was first explained in published form by G. CWick, Phys. Rev. 80 (1950) 
268. Feynman and Wick have applied the method only to chronologically ordered products such as appear 
in (488). However the method applies in the same way to all products whether chronological or not, and 
we shall describe it in full generality. 
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The Reduction of an Operator to Normal Form 

Given any operator O which is a product of field operators, for example 

O = i^{xi)4{xiMxi)i^{x2)4{x2Mx2) (496) 

wc want to pick out the matrix clement of O for a transition between states A and B in which there is a 
known distribution of bare particles. For example A may be a state with only one electron in state 1, and 
B a state with one electron in state 2. Then we wish to pick out from (496) terms in which there appear 
the operators bi and In order to pick out all such terms systematically, we make a complete analysis of 
O into a sum of terms On, each On being a sum of products of emission and absorption operators in which 
all emission operators stand to the left of all absorption operators. Any operator in which the emission 
and absorption operators are arranged in this special way is called "Normal"™ [19]. The On will be called 
the "Normal constituents" of O. Once O has been analyzed in this way, then we find the matrix element 
simply by taking the coefficient of 6261 in the expansion J2^n- No other term in the expansion can give 
any contribution to the matrix element. In O itself there might appear a term such as 

6^636361 (497) 

which would give a contribution to the matrix element, since the operator 63 could create a particle in 
an intermediate state 3 which the operator 63 would then annihilate. The expansion of O into normal 
constituents eliminates all terms such as (497) and replaces them by sums of normal products with numerical 
coefficients. Thus using the anticommutation rule for 63 and 63, (497) becomes replaced by 

Ablh - b2blb3bi (498) 

where A is a numerical coefficient. The second term in (498) gives no contribution to the matrix element. 

It is clear that by using the; c;onimutation rules of the operators in this way, every O can be written as 
a sum of normal products, and that the analysis leads to a unique expansion of O. But we do not need 
to go through the tedious algebra of using the commutation rules, instead we can write down the normal 
constituents On directly by following simple rules. 

First, we define the notation N(Q), where Q is any product of emission and absorption operators, 
to be the product obtained by simply rearranging the factors of Q in a normal order, irrespective of the 
commutation rules, with a factor (—1) if the rearrangement involves an odd permutation of the electron- 
positron operators. Similarly if Q is any sum of products, N{Q) is defined by rearranging factors in each 
term of the sum in the same way. Thus we have for example (see (211)) 

N{Ax{x)A^iy)) = A+{x)A;{y) + A-^{x)A;iy) + A-^ix)A+Ay) + A;{y)A+{x) (499) 

where (x) is the positive-frequency part of Af^ (x) , i.e. the part containing absorption operators. Observe 
that the order of factors in the first two products in (499) is immaterial, only the third and fourth products 
have their order fixed by the condition of being normal. Similarly (see (306) and (309)) 

N{Mx)i;^iy)) = i^+{x)^^{y) + ^l^-{x)i^0{y) + " (ar) (y) - l^p{y)^l^ + {x) (500) 

With this notation, every product of two field operators can immediately be written as a sum of normal 
constituents. Using the commutation rules (213) and the vacuum expectation values given by (219), (220), 

Ax{x)A^{y) = {Ax{x)A^{y))^ + N{Ax{x)A^{y)) (501) 

Similarly, using (310), (311), (324), 

And in fact for any two field operators P, Q we have 



(503) 
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provided V and Q are both linear in emission and absorption operators. The proof of (503) has in effect 
been done by proving (501) and (502), because these include all of the possible products of two boson or two 
fermion operators, and (503) is trivial for the product of one boson and one fcrniion operator, because they 
commute. Equations (501) - (503) are operator identities, and hold whether or not the physical problem 
is directly concerned with the vacuum state of the fields. In fact we could if we wished define the "vacuum 
expectation values" as the functions appearing in (501) - (503) and so avoid speaking about the vacuum 
state at all. 

Next we shall state the generalization of the rule (503) to any product O of field operators, for example 
the O given by (496). We define a "factor pairing" of O by picking out from O a certain even number 
of factors, either all or none or any intermediate number, and associating them together in pairs. For the 
product VQ there are only two factor-pairings, either we choose the pair VQ together or we choose no pairs 
at all. To each factor-pairing n corresponds to a normal constituent On obtained as follows: For each pair 
of factors VQ which is paired in n, On contains the numerical factor {VQ)^, the order of V and Q being 
maintained as it was in O. The unpaired factors TZiTZ^ ■ ■ ■ TZm in O appear in On rearranged in normal 
form. Thus the complete form of On is 

On = ± {VQ), {VQ'), . . . N{1li,n2, ...Tin,) (504) 

the sign in front being + or — according to the even or odd character of the permutation of the electron- 
positron operators from the order in which they arc written in O to the order in which they are written in 
(504). With this definition of (504) of ©„, we have the following theorem: 

Every operator product O is identically equal to the sum of the On obtained from all its factor-pairings. 

This theorem gives the decomposition of O into its normal constituents. Equations (501) - (503) are just 
special cases of it. Clearly non-zero 0„ are only obtained when each pair of factors is either a tjj and a ip 
operator or two A^ operators. We shall therefore suppose that the factor-pairings are always restricted in 
this way. 

The proof of the theorem is very simple, by induction on m, the number of factors in O. The theorem 
is true when m = 1 or 2, so we need only prove it true for m assuming it true for m — 2. Let then O' be a 
product of (m — 2) factors. First we show that the theorem is true for 

= {VQ± QV)0' (505) 

where V and Q are field operators and the plus sign appears only if V and Q are both electron-positron 
operators. In fact, the normal constituents ofVQO' and of {±QVO') will bo identical, so long as V and Q 
are not paired together. Therefore the sum of the normal constituents of O reduces to 

J20n = {{VQ)o ± {QV)J J2 O'n (506) 

But J2^'n = ^\ and 

{VQ), ± {QV), = (VQ ± QV) (507) 

this being a number and not an operator. Therefore (506) gives ^ 0„ = 0, and the theorem is proved for 
O given by (505). Next let O be any product of m factors. Then by using the commutation relations we 
can write 

O = N{0) + S (508) 

where S is a sum of terms of the form (505). The theorem is true for each term (505) and so is true for S. 
The theorem is trivially true for N{0), because {VQ), = for every pair of factors V, Q in the order in 
which they occur in N{0), and so all normal constituents (504) of N{0) are zero except the constituent 
N{0) itself. Therefore the theorem holds for every O given by (508), and this completes the proof. 
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Feynman Graphs 

We use a method of Feynman to enumerate the possible factor-pairings of O. Each pairing is pictured in 
a diagram or graph G. G consists of a certain number of vertices with hues joining them. The vertices 
represent simply the different field-points at which the factors of O operate. Thus for O given by (496) 
each G has the two vertices xi, X2- The lines in G are either dotted, representing photon operators, or 
undotted, representing electron-positron operators. The rules for drawing these lines are the following: 

1) For each factor-pair 'ip(x)ip{y), an undotted line is drawn in G running from x to y, the direction 
being marked by an arrow in the line. 

2) For each unpaired factor tlj{x), an undotted line is drawn running from x out of the diagram, the 
other end of the line being free and not being a vertex of G. 

3) For each unpaired factor ipiy), an undotted line is drawn running into y, the other end of the line 
being free. 

4) For each factor-pair A^{x)Ai,{y), a dotted line joins x and y. 

5) For unpaired factor Afj_{x), a dotted line is drawn with one end of x and the other end free. 

6) Every undotted line has a definite direction marked by an arrow. A dotted line has no direction 
and no arrow. 

In general, we must allow factor-pairings in which two operators at the same field-point are paired 
together. This will give a line in G with both ends at the same point. However in the case of operators 
such as (496) or more generally (488), a pair of factors taken from the same point will always give rise to a 
factor 

{jt^i^))o = -iec{^ixh^^ix))^ (509) 

in the corresponding normal constituents (504). We saw in the discussion following (366) that the vacuum 
expectation value (509) is zero, the operators being IR operators. Therefore factor-pairings in which two 
factors at the same field-point are paired, in the analysis of quantum-electrodynamical operators such as 
(488), always give zero contributions. So we may add to our list of rules for the construction of G: 

7) Lines joining a point to itself are forbidden. 

The possible factor pairings of (496) are then represented by the following G's: 




G2 G4 
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G5 G5 Gy Gg 

Corresponding to these G's there will be just 8 normal constituents of O, which we call Oi . . .Os- These 
normal constituents are, in their respective order: 

Oi = ±N{i;{xi)i(xi)^j{xi)'^{x2)4{x2mX2)} 
02=± {TlKxiWx2))^ N{4.{X1WXI)TI,{X2U{X2)} 

03 - ± {7KX2)MX1))^ N{^{xi)4ixi)4{x2Mx2)} 

04 = ± {4{ximX2))^ N{'^{xi)i:{xi)'^{x2Mx2)} 

05 = ± {7j{xiMx2))^ {4{xM{x2))^ N{i;{x2Mxi)} 

Og^± (^{x2)Mxi))^ {4ixi)4{x2))^ N{^{xi)ij{x2)} 
07 = ± {7j{xi)i;{x2))^ (y^{x2mxi))^ N{4{xi)4{x2)} 

Os = ± (?(X1)^(X2))„ (^ix2Mxi))^ {4{xi)4ix2))^ 

This type of process to which Oi gives matrix elements can be seen at once by looking at the external lines 
of Gi, i.e. the lines which have one end free. Thus, for M0ller scattering only G4 contributes. For Compton 
scattering only G2 and G3. And for a transition from a one-electron atom to a one-electron state, which 
requires an operator of the form ^261, and G5 and Ge will contribute. 

A tp operator not only annihilates electrons but also creates positrons. And a ip not only creates electrons 
but annihilates positrons. Thus an undotted external line with the arrow pointing in represents either an 
electron in the initial state or a positron in the final state. And an undotted external line with the arrow 
pointing out represents either an outgoing electron or an incoming positron. Dotted external lines likewise 
represent a photon either in the initial or the final state, because can both annihilate and create a 
photon. Thus G4 will give not only electron-electron scattering, but also electron-positron scattering. G2 
and G3 give not only the Compton effect, but also two-quantum annihilation of a positron-electron pair, 
and also the inverse process of pair-creation by the collision of two photons. 

We have introduced the Feynman graphs simply as a convenient pictorial way of visualizing the analysis 
of an operator into its normal constituents. The graphs are just diagrams drawn on the paper. But 
according to Feynman^^ , "Space-time Approach to Quantum Electrodynamics", Phys. Rev. 76 (1949) 
769, the graphs are more than this. He regards the graphs as a picture of an actual process which is 
occurring physically in space-time. Thus G2 represents an electron and a photon coming together and 
interacting at the space-time point xi, where the photon is absorbed; then the electron propagates through 
space-time along the line X1X2, until at X2 it emits a photon, and the electron and photon then travel out 
along the outgoing lines at X2- According to Feynman an internal undotted line running from xi to X2 
represents an electron propagating from xi to X2 if the time X2 is later than the time xi, and represents 
a positron propagating from X2 to xi if the time X2 is earlier. In this sense a positron can be regarded as 
precisely equivalent to an electron which happens to propagate itself backwards instead of forwards in time. 

The space-time picture of Feynman is a perfectly consistent and reasonable one. It gives a correct 
account of everything that happens, including pair creation and annihilation and all the other phenomena 
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connected with positrons. It is in fact equivalent mathematically to the field-theory treatment we follow in 
this course. 

The disadvantage of the Fcynman theory is that it is constructed as a particle theory. The fact that there 
are many particles, indistinguishable from each other and obeying quantum statistics, has to be put into 
the theory as a special assumption. And the equations of motion of the particles become quite complicated 
when interactions between several particles, not to mention vacuum-polarization effects, are included. Thus 
the logical basis of the Feynman theory is much less simple than that of the field theory, where everything 
follows from general principles once the form of the Lagrangian is chosen. 

In this course we follow the pedestrian route of logical development, starting from the general principles 
of quantizations applied to covariant field equations, and deriving from these principles first the existence 
of particles and later the results of the Feynman theory. Feynman by the use of imagination and intuition 
was able to build a correct theory and get the right answers to problems much quicker than we can. It is 
safer and better for us to use the Feynman space-time pictures not as the basis for our calculations but 
only as a help in visualizing the formulae which we derive rigorously from the field-theory. In this way we 
have the advantages of the Feynman theory, its concreteness and its simplification of calculations, without 
its logical disadvantages. 

Feynman Rules of Calculation 

The Feynman rules of calculation arise when we analyze into normal constituents a chronologically ordered 

operator such as (488). In this case the vacuum expectation values in (504) are always taken for pairs of 
operators which are already chronologically ordered. Hence the numerical factors in (504) are all either 

{P{A^{x), A^{y))}^ = \ hcDp{x - y) 5^^ (510) 

or 

e{x - v) {P{i,^{x),i^fAy)))^ = -\ Sp^pix - y) (511) 

using (427) and (442). The factor e is put into (511) so that the ± sign still characterizes the permutation 
of electron-positron operators in going from the order as written in (504) to the order as written in O. For 
the same reason, we shall follow Wick and generally use for chronological products the notation 

T{nxn2...Tin) = ±P{n{R2...nn) (512) 

where the sign is plus or minus according to the even or odd character of the permutation of electron- 
positron operators involved in going from the written order to the chronological order in (512). Hence in 
particular we have 

T{A^{x),A^{y)} = P{A,^{x\A^{y)) 
T ( V„ (x) ,iip{y))=e{x-y)P (^^ (x) , (y)) 

and for every set of field operators 7?.i7?,2 . . .??,„ the quantity (512) is a relativistic invariant although the 
P-product by itself is not. In (488) itself the P-product may be written as a T-product, the sign in (512) 
in this case always being plus. 

The rules for writing down the normal constituents of (488) are therefore extremely simple. We are 
generally only interested in those normal constituents which give matrix elements for some specified type 
of scattering process. Then the rules are 

1) Draw all the graphs which have the right set of external lines corresponding to the particles absorbed 
and emitted in the process considered. Each graph G will have the same external lines, but the number 
of vertices and of internal lines will vary from graph to graph. We shall always calculate only up to some 
definite order N in the series (488), and so we draw only graphs with not more than A'' vertices. The total 
number of such graphs is finite. Each vertex in each graph must have precisely 3 lines ending at it, one 



(513) 
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incoming electron line, one outgoing electron line, and one photon line. 

2) To each graph G with n vertices corresponds one normal constituent Sq of S. 

3) Choosing a particular G, write down the n^^ term Sn of the series (488) and pair off the factors of Sn 
as indicated by G. Replace each factor-pair Ax{x)Afj_(y) by (510), and replace each factor-pair tpa{x)tpp(y) 
by (511). Apply an A^-ordering to the remaining unpaired factors of 5„, and multiply the whole expression 
by (±1) following the rule given for equation (504). The result of applying these operations to Sn is the 
normal constituent Se- 
lf we wish to calculate the matrix element for the scattering process, then we have only to add one more 

rule to the three already given. 

4) In each So, substitute for the unpaired operators the wave- functions of the absorbed and emitted 

particles, for example writing (437) for iJj{x) when an electron {p,u) is absorbed, and writing (438) for 
A^{x) when a photon (fc,e) is absorbed. These substitutions may sometimes be made in more than one 
way (for example in the Compton effect when the absorbed and emitted photon may be assigned in two 
ways to the two unpaired photon operators.) In such cases the substitutions are to be made in all possible 
ways, and the results added together, taking account of Fermi statistics by putting in a minus sign when 
two electron or positron wave-functions are interchanged. 

The rules (l)-(4) constitute the Feynman rules for calculating the matrix elements of all processes in 
electrodynamics. According to Feynman they have an intermediate concrete interpretation. Thus (510) is 
the probability amplitude for a photon emitted at x with polarization A to propagate itself and arrive at y 
with polarization /i, plus the amplitude for a photon having been emitted at y to arrive at x. And (511) 
is the amplitude for an electron emitted at y to arrive at x, plus the amplitude for a positron emitted at 
X to arrive at y, with the assigned spins a and 0. In this way the matrix ekuxicnt is just the probability 
amplitude for the succession of events, interactions and propagations, that are depicted in the vertices and 
lines of G. The total probability amplitude for a process is just the sum of the amplitudes derived by from 
the various graphs G which contribute to the process. 

The Feynman rules of calculation take their most practical form when we use the momentum represen- 
tations (430) and (448) for the Dp and Sp functions, carry out the integrations over the points xi . . .Xn, 
and so obtain the matrix elements as integrals of rational functions in momentum-space. In this way for 
example the simple matrix elements (432) and (449) were obtained. 

In the momentum space integral for Sq, there will appear 

(1) A factor corresponding to each internal photon line of G, (514) 

rC 

(2) A factor — corresponding to each internal electron line of G, (515) 

f — t/j, 

(3) A factor (27r)^5^(fci -1-^2-1- h) (516) 
corresponding to each internal photon line of G at which the 3 lines associated with 
momenta (fci, fc2, k^) meet. This factor arises from the integration over the space-time 
position of the vertex. 

In addition to these factors there will be numerical factors and Dirac matrices 7q, arising from the 
particular form of Sn- In practice it is easiest not to write down the Sq directly in momentum space, but 
to use the rules (1) - (4) to obtain formulae in configuration space with the right numerical constants, and 
then transform to momentum space by (430) and (448). 

We shall now show how these general methods work by calculating in detail the historic problem, 
the second-order radiative correction to the scattering of an electron by a weak external potential. This 
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problem has been first satisfactorily treated by SchwingerJ^ Phys. Rev. 76 (1949) 790. Schwinger's paper 
is outstandingly difficult to read, and I hope you will find my treatment at least slightly easier. But the 
problem is in its nature complicated and cannot be done without some fairly heavy mathematics. Once 
the calculations are done for this problem of scattering, it turns out that the results can be used without 
much further trouble for the relativistic calculation of the Lamb shift too. The scattering and Lamb shift 
problems arc very closely related: in both cases one is calculating the second-order radiative corrections to 
the motion of an electron, only in one case the electron is in a high continuum state so that the external 
field can be treated as weak, in the other case the electron is in a discrete state and the potential must be 
treated as strong. 



The Self-Energy of the Electron 

Before we can study the effect of radiation interaction on an electron scattered by an external potential, we 
must first consider the effect of the radiation interaction on a single free electron in the absence of external 
potentials. Let the free electron be given initially in the state (pu). The effect of the radiation interaction 
acting alone is given by the scattering matrix (488). If the initial state is $a, then the final state, reached 
after the radiation interaction has been acting for an infinitely long time, will be S^a- Now S has matrix 
elements only for transitions which conserve momentum and energy. Starting from a one-electron state, it 
is impossible to make a transition to a many-particle state, for example by emitting one or more photons, 
while conserving momentum and energy. Therefore the only non-zero matrix elements of S from the state 
<1>^ will be given by (487), where $b is also a one-electron state. In $s let the electron have the momentum 
and spin {p'u'). 

We consider radiative effects only up to the second order. The term of order 1 in (488) gives transitions 
only with emission an absorption of photons, and hence gives no contribution to the transition $a — > ^s. 
Therefore we may write simply 

S = l + U2 (517) 

with U2 given by (423). We have to calculate the matrix element M2 of U2 between the states {pu) and 
{p'u'). 

To write down M3 we use the Feynman rules. The factor-pairings of U2 are represented in the 8 graphs 
on pp 98-99. Of these only 6*5 and Gg contribute to M2, and they contribute equally since the integral 
(423) is symmetrical in the variables Xi and X2. The normal constituent of U2 arising from G5 and Ge is, 
using (510) and (511)^=* 

^2JV = 51 ^ jjdx^dX2N{^{xi)^x{T{'^{xi),i^{X2)))^l^'4^{X2)) {T{Ax{xi),A^{x2))), 

(518) 

= yy '^^l ^^^2 N{'tl){xi)^xSF{xi - X2)-iXll>{X2)) Df{X2 - Xi) 

To obtain M2 from (518) we substitute for 4'{x2) and ^^{xi) the wave-functions of the initial and final states, 
and use the momentum integrals (430), (448). Then the integration over x\ and X2 can be carried out and 
we find 

ie^ f f „ f_, 1 \ 1 

A 



M2 = ^— / / dkidk2 (u'ta-w —l\u]T^S{ki-k2-p')6{k2-ki+p) 

(519) 



A 

We consider the Dirac operator 



= E ^ ^(p - p') X * (^^^ ^TT^'^^") P 
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appearing in (519). Since {p,u) are the momentum and spin of a real electron, we may use the relations 

p"^ ^fx"^ = 0, {f - iii)u = (521) 

when wc evaluate E(p) in (519). So using (376), (585) and following the same method that was used in 
evaluating (377)^^ 







\ j dz j 
Jo Jf 



where we changed the origin of the k integration by the replacement k ^ k — zp and we eliminated the odd 
terms. Using (386) and introducing the logarithmic divergence R again according to (387), 



S(p) = 2 / dz + z){2m^{R - log z)} = -n^n [6R + 5] = -Gtt^ijR' 
Jo 



(523) 



Thus S(j3) is a logarithmically divergent constant, depending only on the electron mass and independent 
of the state of the electron. The difference 5/6 between R and R' is of course not significant. Substituting 
(523) into (519) gives the value of M2 

2 

M2 = -6ttH^R'S(p - p') iu'u) (524) 

he 

Thus U2 does not give any transitions between different one-electron states. It has only the diagonal matrix 

elements given by (524) . 

Now (524) has precisely the correct relativistic form to be identified with a pure self-energy effect. 
Suppose that in consequence of the radiation interaction the mass of a real electron is 

m = nio + 6ra (525) 

where mo is the mass of the bare electron without interaction and 5m is the electromagnetic contribution 
to the mass. The mass-change 5m would be represented by a term 

= -5m '^ip (526) 

in the Lagrangian density (410). This would produce an interaction energy 

Hs{t) = 5mc^ j ij{r, t)tp{r, t) d^r (527) 

in the Schrodinger equation (415), and finally a contribution 

Us = -i^ J lp^{x) dx (528) 

in the scattering matrix (421) or (488). 

The matrix element of (528) between the states (pu) and {p'u') is 

Ms = -i^{2Trf 5{p - p') {u'u) (529) 
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This is identical with (524) if we identify the self-mass Sm by the equation 

Sm = A 4^^' = ^R'm (530) 
87r2 he 2Tr ^ ' 

For all onc-clcctron matrix elements, U2 is identical with Us- That is to say, the whole effect of the radiation 
interaction upon a free electron is to change its mass by the amount (530). This is a most satisfactory 
conclusion. It means that an electron with its self-field still has the correct relationship between momentum 
and energy for a relativistic particle, only the value of the rest-mass being changed by the self-field. It was 
always one of the central difficulties of the classical electron theory, that a classical extended electron did 
not have the right relativistic behavior. 

The self-mass 6m is an unobservable quantity. The observed mass of an electron is to, and neither TOq 
nor Sm can be measured separately. Thus it is unsatisfactory that 6m appears in the scattering matrix S 
which is supposed to represent the results of experiments. 

The reason why 6m still appears explicitly is just that we have not used the observed mass to in defining 
the initial and final states of the system. We defined these states as states of a free electron with the bare 
mass Too- Wherever we used the letter to for the electron mass in the theory up to this point, in fact it was 
an inconsistency of notation and we meant by to the mass of a bare electron. 

It is much better not to change the notation, but to keep the notation and change the interpretation, so 
that to everywhere in the theory is now intended to mean the mass of a real electron. In particular, we set 
up the interaction representation operators with the real electron mass to, and the initial and final states of 
scattering problems are defined as free particles with the correct mass to. With this changed interpretation, 
the whole of the theory up to this point is correct, except that in which appears in the Lagrangian (410) 
of quantum electrodynamics, and in the field equations (411), (412) satisfied by the Heisenberg operators, 
the bare mass TOq must be used instead of to. We prefer to keep the observed mass to in Jfjj, and correct 
for it by writing instead of (410)^^ 

^ = ^n+^M- iei^M - ie^4> - -^s (531) 
with given by (526). The radiation interaction becomes then 

HR{t) - Hs{t) = H\t) (532) 

with Hr given by (416) and Hs by (527). After making the changes (537) and (532), the whole theory 
becomes consistent with the interpretation that to is everywhere the observed electron mass. 

In particular, one result of (532) is that for one-electron states the scattering operator S becomes 

S = l + U2-Us (533) 

instead of (517), keeping only terms of order e^. The matrix elements oi {U2 — U s) ior one-electron states 
are all zero. Thus, if we use the correct mass m in defining the states of an electron, there are no longer 
any observable effects of the radiation interaction on the m,otion of a free electron. This shows that the 
mass-renormalization, the procedure of inserting the term (— -S^s) in (531), is consistent and is likely to give 
sensible results. 



Second- Order Radiative Corrections to Scattering 

Let an electron be scattered from the initial state {pu) to the final state {p'u') by the external potential 

Al{x) = j^Je"'--e,{q)dq (534) 

At the same time the electron is interacting with the quantized Maxwell field with the interaction (532), 
since we suppose the initial and final states to be defined with the observed mass of a free electron. The 
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scattering matrix element M is then given by (419), with U given by (421) after replacing each Hn by 
according to (532). 

We treat in the linear Born approximation. Thus we keep only terms of order and 1 in The 
terms of order give the effects of the radiation interaction alone; as we have seen, these effects are zero 
for an initial state consisting of a single electron. 

The scattering matrix is thus given effectively by the terms of order 1 in A^ taken from (421), namely 



ra=0 



h 



^J---J dtdti...dtnP{H-{t),H\ti),...,H\tn)) 



(535) 



We shall calculate radiative effects only up to the second order in the radiation interaction. Since 5m is 
itself of second order, this means that we go to second order in Hr and to first order in Hs- Thus 



Uo 

U2 



he 



ie dm 



jj dxdxiP {ii)4^il}{x),il)Jiil){xi)} 
jjjdxMx2P{l^mx)M^ix.)M^ix2)) 

JJdxdx.pm'^ixmix.)} 



The matrix element we wish to calculate is then correspondingly 

M = Mo + Mi+ M2 + M2 
The wave-functions of initial and final states are 



ue'- 



ue'- 



Then by (534) we have 

where q is the constant vector 
and 



Mo = -^(u'^u) 



q = p -p 



(536) 
(537) 

(538) 

(539) 

(540) 

(541) 

(542) 

(543) 

(544) 

(545) 



The operator Ui gives transitions from a one-electron state only to states consisting of an electron and 
a photon. This is just the bremsstrahlung process, scattering of the electron with real photon emission, 
and the matrix clement for it is given by (484). In any scattering experiment, this process will of course 
go on at the same time as the scattering without radiation. Experimentally, the scattering with photon 
emission will only be; separable from the radiationless scattering if the emitted photon has an energy greater 
than some limit Ai?, roughly equal to the energy resolution with which the energy of the electron can be 
measured. The scattering with emission of soft quanta (low frequency, k' small) will always be included 
in the radiationless scattering cross-section. Therefore we shall be interested in the value of Mi for a final 
state consisting of the electron {p'u') and a photon with potentials (440), in the case where k' is so small 
as to be negligible in comparison with p, p' and q. In this case (484) gives 



Ml = 



p ■ e 



p -k' p' ■ k' 



(u'^u) = 



he 



p ■ e 



p' -e' 



p -k' p' ■ k' 



Mn 



(546) 
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where we used (521) and rule (4) on page 87. 

We now come to the calculation of M2, the matrix element of (539) between the states (542). This is the 
main part of the problem. To do it we use the Feynman rules. There are just 9 graphs giving contributions 
to M2, namely 







and {Ge, G7, Gs, Gg} obtained by interchanging the labels {xi,X2) in {Gi, G2, G3, G4}, respectively. We 
can see this in the following way: The process we are interested in calls for one external photon line and 
two external electron lines. Hence ^ (x), one and one must be unpaired. Therefore the two ^'s are 
always coupled. The free electron lines can be i/'oiV'i; V'ijV'o! V'liV'i; "01: "02 and four more cases obtained 
by the substitution 1 «-> 2. In each case, the rest is uniquely determined by the rules. The ninth case is the 
one with external electron lines ipQ^ipQ. 

The effect of {Gg, G7, Gs, Gg} is only to double the contribution from {Gi, G2, G3, G4}, since (539) is 
symmetrical in the variables xi and X2. Also G5 will give only a numerical phase factor multiplying Mq, 
the phase factor being the same for all final states; it is in fact precisely the phase-shift between the initial 
and final states due to the self-energy of the vacuum. Similar phase shift factors would also multiply the 
contributions of Gi, G2, etc., coming from disconnected graphs in higher order contributions. In this sense 
therefore, we can consider G5 to be really the contribution of Uq with one of the many possible disconnected 
additions. A numerical phase factor of this kind, the same for all final states, is entirely unobservable and 
without physical meaning, since it can be cancelled by changing the phase of all wave-functions by the same 
amount. Hence we can always ignore graphs such as G5 having a disconnected part without external lines. 
There remain to be considered only {Gi, G2, G3, G4}. 

Using the Feynman rules, the contribution of Gi to M2 is (with the factor 2 from Ge) 



M21 = 



dx dxi dx2 J2 Spur {4" (x) {T{^ix), ^(xi)})^ 7m (^^(2^1), x 

X ^{X2) {T{A^ixi), 4ix2)})^ 0(X2) 



(547) 



where the spur appears because of the contraction according to rule 3, and the minus sign comes from the 
change in order of ip and ip factors between (539) and (547). By (510) and (511) 



M21 - 



dx dxi dx2 ^ Spur {4-'^{x)Sf{x — xi)'^f_iSF{xi — x)} Dp{xi — X2)ip{x2)'yfitp{x2) 



Hence, using the momentum integrals (430), (448), (534), (542) and carrying out the integration over 
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{x,Xi,X2), 

= (2^^ //// '^^^ 5: Spur {^(9)|;^7.^^} ^ (u'^.u) X 

X + /ci - A:2) (5(-fci + + /cg) (5(-/c3 - + p) 

_ ie^ . 1 \ 1 , , (548) 



with g given by (544) and^® 



= / F^{k)dk (549) 



the function F^{k) being identical with (371) for 6 = 0. Note that (549) is a Feynman integral, which is 
precisely the same thing as the contour integral (374) with the contour drawn in the diagram. The effect 
of the e in (431) is just equivalent to the contour C. Hence''^ using (388) for J^, 



^21 = {u'^u) i^lR-j\z- Z-) log (l + (. - z-)^^ dzj (550) 



where we have also dropped the term in(388), since 

{u'fu) = {u'{f' - iiJ.)u} - {u'{p - iiJ.)u} = (551) 



Writing a = , ^ , (550) becomes 
4TThc 

M2i=aMol^-^R+^ {z-z^)\og(l + {z-z^)^^dz^ (552) 

This is just the scattering which is produced by the charge-current density induced in the vacuum by the 
potential A^ according to (392). As before, the term in R is unobservable since it can never be separated 
experimentally from the simple scattering Mq to which it is proportional. The observed external potential, 
measured in any way whatever, will not be A^ but A^(l — ^R), which we may call the "renormalized 
external potential" . Hence in terms of the observed A^ the total contribution from Gi will be 



M21 = —Mo [\z - z^)\og (l + {z- z^)^ ] dz (553) 

Jo V M 



The integral will be in general complex as before. But for small q it will be real, and neglecting terms of 
order higher than g^, we have 
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Next we consider the contribution to M2 from G2. This is 



M22 = TTTT- 



■ J J J dx dxi dx2 ^2 '4'{x)4-^{x)Sf{x - X2)^\Sf{x2 - xi)'y\ii{xi)DF{xi - X2) (555) 

^g3 /• /• /• r ( ^ ill 

dki dk2 dks dq < u'^{q) -g r 



{2w)^h^c^ JJJJ '^'^ "'"^ "'"^ I - ^v-^y _ 7A _ 7A« I ^2 X 

X +q-p') S{k2 - fci - fca) 5(A;3 + p - fca) (556) 

»3 



I dk [u'^—^lx . 7am) (557) 



A 

ie^ / 1 



u'^^^ E{p) u ] (558) 



(27r)4/i2c2 V - i/i 

with S(p) given by (520). 

Before discussing E(p) we must look at the factor - — — which appears in (558). This factor is 

But since p is the momentum vector of a real electron, + = and the factor (559) is singular. 
This means that the integrals over xi and X2 are really divergent and not merely finitely oscillating at 
t = ±00, and the transformation into momentum integrals is not allowable. Eq. (558) as it stands is 
strictly meaningless. 

This is the place where we have to take explicitly into our calciilations the slowly- varying cut-off functions 
g{ti) appearing in (495), which are put in for the purpose of defining unambiguously the initial and final 
states of the problem. So we write instead of (555) 

M22 = g^2g2 jjj dxdxidx2 'Ytp{x)4'^{x)SF{x-X2)j\SF{x2-xi)-fxip{xi)DF{xi-X2)g{ti)g{t2) (560) 

Here the g{ti)g{t2) factors are attached to the radiation interaction operating at xi and X2- It is supposed 
that the time T over which g{t) varies appreciably is long compared with the duration of the scattering 
process. Let the Fourier integral representation of g{t) be 

/oo 
G(eo)e-'^''^*deo 

(561) 

G(eo)e*^-^ deo 



/ 

J — < 



where eo is a real variable and e is the vector 

e=(0,0,0,eo) (562) 

We have the normalization 

/oo 
G(eo) deo = 1 (563) 
-00 

and we suppose that G(eo) is "almost" a (5-function, that is to say a function which is large only for values 
of eo in a range of about (cT)~^ on either side of zero. Substituting (561) into (560), we obtain instead of 
(558) the corrected formula 

= " (2^^PW / [ ^^'0^ "^'^ "^'0 I «^(9 - e - e') J^^J^jr—^iP + e) w i (564) 
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In (564) the inadmissible factor (559) is replaced by something finite and mathematically well-defined. There 
will be a singularity in the integration of (564) over eo, but this is an ordinary pole and the integration over 
Co will give a well-determined result when taken as a Feynman integral. We have as T — > oo and eo and 

1 ^ f + ^ + + ^ f + in 

f + i + i'-ili 2p-(e + e') + (e + e')2 " 2po(eo + eo) 

Hence when evaluating S(p + e) wc need retain only terms of order zero and one in eg; the terms of order 
two and higher are negligible, because even when multiplied by (565) they tend to zero as T — » oo. 
Keeping only terms of order zero and one in e, E(p + e) becomes 

S(p + e) = S(p) - ^Mp) (566) 

a 



/« (p) = j^dkY, ( 7A „ , ] ,.. 7a „ , ] .... 7A 1 1^ (567) 



Here we have used the identity 



A+B A A A A A A 



which is valid for any two operators A and B, commuting or not, if the series on the RHS converges in some 
sense. This can be seen at once by multiplying through by A + B; then the condition becomes {B/Af" — > 
in some sense. 

In (564) we may use the conditions (521), and these give for S(p) the constant value (523). The integral 
Ia{p) is like S(p) logarithmically divergent for large k, and it is also logarithmically divergent for small 
fc, which S(p) is not. We shall not attempt to evaluate Ia{p) mathematically. From general principles of 
covariance we can say what is its form as a function of p. For general p not satisfying (521), Ia{p) is a Dirac 
matrix transforming like a vector under Lorentz transformations, and therefore must be of the form 

laij)) = -Fl(p^)7a + -F2(P^)(|> - ilAloc + ^3(p^)7a(|^ " «M) + i^4(p^)(|^ - «M)7a(|> - «M) (569) 

where Fi, . . . , F4 are functions of the scalar p^. Hence using (521) and (523), we see that in (564) we may 
put 

S(p + e) = -67r2/ii?' - - I^ijf) - in)^ (570) 
where Ii and I2 are new absolute constants, and in particular 

/i=i^i(-/x2) (571) 

But in (564) the term 

is of order e and tends to zero as T — > 0. This term may be dropped, and then (564) becomes''^ 

(572) 



\u^{q-e- e') ^ ^ , ^ . (-677 W - h!^)u\ 

Note that if Ia{p) given by (567) were to be evaluated assuming that + = and f — iii = Q operating 
both to the left and to the right, instead of only to the right as in (521), the result obtained would be just 

/a(p)=/l7a (573) 
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This is a convenient definition of Ii for future reference. 

Now it is clear that the term R' in M22 represents some kind of effect of the electron self-energy, which 
ought not to be observable. We may expect that this term will be cancelled by the term Afj arising from 
the self-energy correction Hs in (532). This is all the more plausible, because the graph G2 on page 106 
contains as a part the graph G^ on page 99 which represents the self-energy of a free electron. We now turn 
to the calculation of Mj. 

M2 is the sum of two contributions arising from the two graphs shown below. 




From G2 the contribution is (compare with (528)) 

= — // dx dxi'ij:{x)4^'' Sf{,x - xi)il}{xi) (574) 

Like (555) this integral does not oscillate but diverges at ti = ±00. Therefore we must take explicitly 
into account the cut-off factor multiplying the radiation interaction. At time ti the radiation interaction 
HR{ti) will carry the cut-off factor g{ti). But the self-energy Sm at time ti is an effect of second order 
in Hj^, and therefore becomes multiplied by [^(ti)] if 5(^1) varies slowly enough. In the definition of the 
scattering matrix element (487), the cut-off factors g(t) were introduced in order to represent the initial 
and final states by simple bare-particle wave- functions in an unambiguous way. We now require that the 
bare-particles' wave-functions should always have the same mass m as a real electron. This is achieved if we 
add the term {—Hs) to the radiation interaction appearing in (495), each Hs(ti) being multiplied by the 
cut-off factor [^(ti)] so long as we are dealing only with second order terms in 6m. If we were calculating 
effects to fourth order in e, then the fourth order part of Sm should be multiplied by [g(ii)] ; on. 
The effect of the cut-off factors is therefore to replace (574) by 

^^22 = ~ 2fi2 jj dxdxi'4!{x)jL' Sf{x - xi)'ip{xi)[g{ti)] (575) 

Using (561) and performing the integrations as before, this becomes 

M^2 = // G(eo)G(e[,) deo de', |u7(g -e-e') (^—^1^^—^^ (576) 

In virtue of (530), this term does precisely cancel the term in R' in (572), as was to be expected. 

To simplify the term in Ii in (572), we may replace ^ by i(^ -t- f') since the integrand is otherwise 
symmetrical between e and e'. And using (521) we may replace this in turn by ^ (^ -I- ^ -I- ^' — i/i). This 
cancels the denominator of (572) precisely. After the denominator is cancelled, the expression is non- 
singular and we may go to the limit T — > 00, using (563) to perform the integration over eg and Cq. Since 
it is assumed that the external potential is of a limited duration not tending to infinity with T, the factor 
^(g — e — e') is a continuous function of e + e' and tends to ^{q) as T ^ 00 7'^ Hence in the limit T — )■ 00 
we have 

1 tOL 

+ ^^22 = ^^2 2 - ^1 (577) 
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The graphs G3 and Gg give an exactly equal contribution. Hence 

icx 

M22 + M23 + + ^23 = ^ h Mo (578) 

It turns out that Ji is pure imaginary, and the factor multiplying Mq in (578) is real and negative. 

What is the physical interpretation of the divergent term (578)? It is just a divergent constant multiple of 
Mo, like the charge-renormalization term in (552). So one is tempted at first to call it an additional charge- 
renormalization effect. But this cannot be right, because the whole charge-renormalization was calculated 
in (392) and the result agreed with (552). In fact (578) has a much more elementary interpretation. When 
the electron arrives at the point x of the external potential where it undergoes scattering, there will be a 
certain probability P that it will have previously emitted and not yet reabsorbed a photon, as pictured in 
the Feynman graph G4. There will be a probability (1 — P) that it will arrive at x not accompanied by a 
photon, as pictured in G2 or G3. 

Consider then the contribution Mjvp to the matrix element M, produced by scattering processes in 
which the electron arrives at x with no photon present. In the zero-order approximation Mjvp = Mq 
simply. But in the second-order approximation we must take account of the reduced probability that the 
electron comes to x without a photon present; this is done by multiplying the wave- function of the electron 
in both initial and final states by the factor 

(1 - P)i/2 (579) 

Therefore in second-order approximation 

Mjvp = (1 - P)Mo (580) 

Since in second order 

Mnp = Mo + M22 + M!22 + M2-i + (581) 
the result (578) agrees with (580) provided that we identify P with 

P = -^7r (582) 

The factor (579) represents a renormalization of the amplitude of the wave-function, and for this reason 
(578) is usually called a "wave-function renormalization" effect. But this does not mean that the term 
(578) is to be removed by a process analogOTis to mass or charge renormalization. No difficulties arise if we 
simply retain (578) as it stands. Finally it will cancel out against a term {-\-PMo) which comes from G4. 

The contribution from G4 to M is, with a factor 2 to allow for Gg, using the Feynman rules, 

M24 = g^2g2 jj dx dxi dx2y^^tp{x)^xSF{xi - x)4''{x)Sf{x - X2)JX^{x2)Df{x2 -xi) 

^ (583) 
{uk{p,p)u) 



where 

A(„', = /^.ifci:(7.y^^|^7.)^ (584) 
There is no singular factor in (584) such as we had in (558). To perform the sum over A in (584), we use 
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the table 

A 

^ (585) 
X] 'yx0'yx = 4(a • b) 

X 

Y,ix0^ix = -'^i^H 

X 

valid for any vectors a, b, c. These formulae can be deduced from the following recursive formula: 
Denote j^^^^ = ^^^^ ...^^ where are arbitrary vectors, and Xn = 7a^(„)7a, Xo = 4. Then we have 

Xn+l = l]7A^(„)^„+i7A = X]Zl'T^^(„)'>A«7A(5n+l)/^ = Y.Y,^>^^{n) P*^^" " 7a7a.] (wOa* 
A A A /i 

which then gives (585) for n = 1, 2, 3. Thus 

A(V v') --2( dk {^+fW^f)-^^^^{^<'-k + e-P + e-p')-t,-^ 

A[p,p)- IJ^dk k^k^+2k-p'][k^ + 2k-p] ^^^^^ 

In (583) we may use the relations 

+ fi^ = p'^ + = {f- in)u = u'{f' - i/i) = (587) 

We also assume that the external potential satisfies the Lorentz condition 

^ =0 so that e-q = Q (588) 



— = 2/ / xdy^-^ ■ -3 (589) 



To evaluate (586) we use the 3-variable generalization of (376) 

1 

abc ~ Jq ""~ Jq ~ [a(i - x) + bxy + cx{l - y)f 
which one can verify at once by direct integration. We write 
Pv =py + p'{l - v) 

pI = [-{P' - P)y + Pf = Q^y^ - (2p" - 2p • p')y -12^ = q^y^ - (p'^ -2p-p'+ p") - /x^ (590) 

2 / 2\ 2 

= -H -{y-y )q 

Then changing the origin of the fc-integration by the substitution k ^ k — xpy (586) and (589) give 

rr r - xp +p)i0-xp -\-f)-2iij.e-{2k-2xpy+p + p')-iJ?^ 
A(p,p') = -4 xdxdy dk -^^2 T^T^ ^^^^^ 

J J J F \JZ X Py\ 

In (591) we may drop terms which are odd in k. Also using (587) and (588) we may put 

e-p = e-p' = e-py = + (592) 
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- xf^i-{i - xfy) = {(1 - a;)i/i - (1 - xy)(f] if {(1 - x)iix + (1 - a; + xy)^] 

= -(1 - + (1 - x)itJi^{2 - a;) + (1 - xy){l -x + xy)q^^ 

Collecting terms from (592), (593), we have 

+ (1 - xy){l — x + xy)q^j, — (x - 



A(p,p') = -4 ^ xdxdy J 



dk 



(2-2a;-a;2)^2. 



(593) 



(594) 



Now we saw earlier that the integral (567), evaluated using the conditions + fj,^ = 0,p — ifj, = 0, has 
the value (573). Comparing (567) with (584), this implies that 



A{p,p) = A{p',p') = Ii^ 



(595) 



when the conditions (587) with p' = p are assumed to hold. Thus (h^) is just the value of (594) for p' = p. 
Adding together (583) and (578), this gives 



M2T = M24 + M22 + + M23 + = 
Ae(p,p') = A{pp') - \ {A(p,p) + A(p',p')} 

= -4: jj xdxdy I J dk [^0 + {2-2x- x^)p?i\ 



{u'Acip,p')u) 



(596) 



1 



1 



+ [(1 - xy){l - x + xy)q^^ - {x ■ 



L 



[fc2 + x2 (/x2 + (y _ y2) g2)]3 |;,2 + a;2^2j3^ 
1 1 



dk 

F [A;2 + a;2 {jj? + {y - y"^) q^) 



(597) 

The fc-integrals in (597) are now convergent. Thus the effect of the "wave-function renormalization" term 

(578) is just to cancel out the part of M24 which is independent of q and divergent at high frequencies. 

To evaluate (597) we use (385) and (386). First, in the term we may replace k^^k^ by jd^^k"^ because 
of the symmetry of the integral in fc-space. So we may write, using (585) 



^k^f 



(598) 



Then by (385) and (386) 



dkk' 



+ Af [fc2 + Af 
1 



/ dkk'^l ^ 

Jf \ [fc2 + 

-L 

and (597) becomes 

A^ipp') = 2.\jj xdxdy [-^Xo, 



A 



A' 



[fc2 + A]' [fc2 + AT [fc2 + A]'' [fc2 + Af 



2., A' 

TT llOg — 



(599) 



+ 



{x-l + 2{y- y2)(l -X- x'^)}q^if + {x - x' 



a;2(/i2 + (y - y2)^2 



Integrating the logarithmic term by parts with respect to y, 



A,{pp')=-27T^l 



JO 



dx dy 



1 



X [^2 + {y- 2/2) gr2] 



{ [(1 -x){l-2y + 2y2) + x^y] q^^ - {x - x^)ifi^^} 



(600) 
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When < — the external potential is able to create real pairs, and the denominator in (600) has 
poles in the range of the ^/-integration. In this case the Feynman rule of adding a term (—ie) to /U^, where e 
is an infinitesimal positive real number, will give an unambiguous determination of the integral. Just as in 
the case of the vacuum polarization formula (389), the integral will split into a real part and an imaginary 
part describing the effects of the real pairs created. We shall not discuss these effects of real pairs since 
they are not practically important. So we assume > — 

In (600) there are no longer any divergences arising from large k. But (584) has a logarithmic divergence 
at small k which appears in (600) as a divergence in the a;-integration arising from the factor (l/x). This 
last remaining divergence must now be examined in detail. It is the famous "Infra- Red Catastrophe" . 

To discover the physical meaning of the x-divergence, we consider what would be the effect on our 
calculations if the Maxwell field were somehow modified so that all field oscillations with wave-numbers 
satisfying 

|fe|>r (601) 

were present as usual, while all oscillations not satisfying (601) were simply absent or incapable of being 
excited. We suppose r to be a constant small compared with m, p, p' and q. Thus photons will exist only 
if their energies exceed 

AE = her (602) 

In the modified Maxwell theory the Dp function will still be given by the integral (431), the ^1,^2,^3 
integrations being limited by (601), and the fcp integration being taken as usual along the whole real axis 
from —00 to +00. Let A^{p,p') and A^{p,p') be the integrals which replace K{p,p') and Ac(p,p') when the 
Maxwell field is modified. We calculate the differences (A — A' ) and (Ac — A^), considering these integrals 
only in the limit of small r, neglecting all terms which tend to zero with r. This means that we may neglect 
terms containing either fc or x as a factor in the numerator of integrals such as (591) or (594). 

In (583) there is just one factor Dp- Thus A^{p,p') is obtained from (584) simply by restricting the 
ki,k2,k3 integration by (601). We can now follow the reduction of (584) to the form (594), except that 
wc do not shift the origin of the /c-intcgration through (xpy) since this would disturb the condition (601). 
Dropping terms in the numerator having /c or a; as a factor, this gives the result 

K{p,p') - A^{p,p') = -Ajjxdxdy (g2 + 2/^2)^ ^ ^1,2 ^^Ik ■ f ^^^^^ 



Hence by (597) 

= —4 jj xdxdy^ [ dk 



q^ + 2^i^ ^2 ^2 1 (604) 



F [[k'^ + 2xk-pyY [k^ + 2xk-p]-' [k^ + 2xk-pf} 

The integral (604), with the integration extended over the whole /c-space, would give, using (385), (587), 
and the substitutions k ^ k — xpy, k k — xp, k xp' , respectively in the three integrals, 

which is just the divergent part of (600). But the integration in (604) actually extends over k not satisfying 
(601). Therefore subtracting (604) from (600) and using (605), we find for Al{p,p') the final result 



K{P,P') = f dy I {(-1 + §2/ - 22/2) - 

-JjxdxdyJ dk[ ^'+'^' 3 ^- 3 3' 

J J Jf [[k'^ + 2xk-pyf [k'^ + 2xk-pf [k'^ + 2xk-pf 



(606) 
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This integral is completely convergent for every finite r, the /c-integration being restricted to k satisfying 
(601). The formula (606) is exact except for terms which tend to zero with r. 

To evaluate the k integral in (606) for general p and p' is possible but tedious. So we shall do it only in 
the case of non-relativistic velocities, when 

IpKa^, Ip'Km, I^Km (607) 



where \p\ means y/pf + p% +P3, the magnitude of the space-like part of the 4- vector p. In addition to (607) 
we still assume r small compared to q, p, p' . 
We consider the integral 

K = I [ xdxdy [ dk 3 (608) 

Jo Jo Jf [k^ + 2xk ■ py] 

integrated over k satisfying (601), and evaluate it including terms of order q^, but neglecting 

higher terms. Integrating over ko only, we have for any positive b 

f 1 dk 1 

*° [k^ + 2ako + b] = ' [\k\^ + ki + 2iako + b] = ^\k\^ + + b ^^"^^ 

Differentiating (609) twice with respect to 6, 

^'^{|fe|2 + a2 + 6}"''' (610) 



dko SiTT f I , ,2 2 , u^ "5/2 



If [A;2 + 2ako + b^ 
Hence^o 

K= [fxdxdy^ [ d'k{\k + xp^\'+x^^,' + {y-y')q')}-'^' 

JJ ° J\k\>r 

= IJxdxdy^J^^^^ d'k{{\k\^ + x'^,Y'/'-l{2xk-py+x'\Py\'+x'iy-y')q') x 

(|fe|2 + x2/.2)-7/2 ^ ^ . p^)2 (|fc|2 ^ ^2^2^-9/2| (611) 

= jjxdxdy^-^p e dk { {e + xV') - I xMlP.P + (y - y') {k'' + xV') 

+ fx^k^\pf{k^+x' 
We can now carry out the integrations over x and y, using 



1 

|pj2rfy = p-p' + ig2 (612) 

/o 

This gives 

3^^2 /.oo r 1 / 1 1 \ ^ . W _L i„2 



13^2 Vfc^ (fc2 + /x2)3/2y 3/^4 Vfc^ (fc2 + ^2)3/2 

" 2/x2(fc2 + ^2)5/2 ) + [P • P' + I?'] 7 ( fc^ - (A:2 + ^2)5/2 (^13) 

_ 5 1 

~^(fc2+ ^2)7/2^ 

The fc-integration is now elementary, and after dropping terms which tend to zero with r we have 



K = 



(2 log 



(614) 
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Putting p = p', q = in (614), we find 

= I' [.d.dy I dk p^^^ = '-^{k (log I: + l) + (615) 
Substituting (614) and (615) into (606) and dropping terms of higher order than \p\^, and q^, 
KM = -2.". {-4^^ - i '-tt] - if^ {i,' (logii + l) - 5^,' (2106 1: + !)} 



(616) 



Using (596) and adding the contribution (554) from Gi, we find for the second-order terms in (541) the 



value 

M, + m; = -ii {log ii + « - i} ^1 M„ + m 

The Treatment of Low- Frequency Photons. The Infra-Red Catastrophe. 

The second-order correction (617) to the scattering matrix element Mq has been made convergent by only 
taking into account the effects of photons with energy greater than AE according to (603). When AE 
the correction diverges logarithmically, and this divergence must now be interpreted. 
In the approximation where |p|^ and |p'p are small compared with (546) gives 

'^' = SSI ''''' 

The total probability that an electron is scattered between the initial and final states (542), with the 
emission of a photon with potentials (440), summed over all photons with frequencies lying in the range^^ 



n < \k'\ < r2 (619) 



is therefore 



Wnin,r.) = ld^k'^^[Jl^^m 



e2 



Stt V ri) /i2 



(620) 



|2 



In (620) it is assumed that both r2 and ri are frequencies small compared with |q|. 

On the other hand, the probability that an electron is scattered between the states (542) without 
emitting a photon is given by 

Wn = |Mo + M2 + M^|2 

= |Mop + {M2 + M'^) + (Ms + M^)* Mo ^^^"^^ 

neglecting terms of fourth order in the radiation interaction. If we consider in (621) the contribution only 
from virtual photons with frequencies in the range (619), then by (617) we have 

M2 + M^ = -^(log^)4l^oP 



GENERAL THEORY OF FREE PARTICLE SCATTERING 



117 



So the contributions to (617) from low-frequency virtual photons serve only to compensate exactly the 
probability for scattering with emission of low- frequency real photons. The non-radiative probability is 
decreased by the effects of low- frequency virtual photons, so that the total scattering probability, radiative 
plus non-radiative, is essentially independent of the presence of very low-frequency photons. The total 
scattering probability is thus a finite quantity free from any infra-red divergence. 

To describe correctly the radiative corrections to scattering it is essential to specify the critical energy 
AE below which real photons will not be detected. Ideally we suppose that in each scattering event a 
photon of energy greater than AE is detected with 100% efficiency, a photon of energy less than AE with 
zero efficiency. Then the total observed probability for non-radiative scattering is given by (621), with 
M2 + given by (617) and 



This probability (621) will include the scatterings in which a photon is emitted with energy below the limit 
of detection. The formula (617) is valid so long as 



The probability for radiative scattering (i.e. scattering with emission of a detectable photon) is given by 



It can be proved that this removal of the infra-red divergence by taking into account the existence of 
unobservable photons is quite general, and works equally well when q is not small. Only then the evaluation 
of (608) is considerably more unpleasant. Also, the same argument removes all infra-red divergences, also 
when higher-order radiative corrections are considered, in which case we are concerned with the effects of 
the emission of two or more soft photons during the scattering process. For a general discussion of this 
question see Bloch and Nordsieck, Phys. Rev. 52 (1937) 54. 




(623) 



r < |p|,|?>'|,|g| < M 



(624) 



(546). 



Scattering By A Static Potential. Comparison With Experimental Results. 



Consider the scattering of an electron by a time-independent electrostatic potential 

V{r) = [ d^qV{q)e''i'- (625) 



(27r)3 , 

Then (543) gives the matrix element for scattering without radiative corrections (compare (625) and (534)) 

Mo = 2m^{u'*u)V{q) 5{qo) (626) 

he 

Non-radiative scattering occurs only between states for which 

90 = 0, |p| = |p'| (627) 

The cross-section for scattering between the states (542), per element of solid angle d£t in the direction of 
p' , is without radiative corrections 

a^=(£^)^\u'*u\''\V{q)\''d^ (628) 

This follows immediately from (626) using (627), using the prescription derived on page 29, when we treated 
the M0ller scattering the first time. The procedure is as follows:^^ 

pdE= —— -—^ EdE = h^c^p dp d^p = p^dp dfl 

hi (27r) 

wV mc^ c^hp 



(7 = 



; dfi \u'*u\^ \V{q)\^ = dn \u'*uf \V{q)\' 



(27r)2fi3 E d^hp ' ' ' 
Summing over the final spin states, and averaging over the initial spin states, we have 

5 13 13 = 57^Spur{(|^ + i/.)74(/ + i/x)74} 

= ^{m'+PoPo+P-p'} = ^{2p^-i|g|'} (629) 
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where is the angle between p and p' , and 



/3=^ = ^ (630) 
Po c 



where v is the velocity of the incident electron. Hence the cross-section for an unpolarized electron beam 



IS 

2 



where E is the energy of the incident electron. 

The second-order radiative correction to Mq is given by (617), which in this case becomes 

M, + = {log ii + ii-i}^Mo-|^ {u'*iu)V{q) 6{qo) (632) 

This gives a second order correction to the cross-section cto according to (621). The total non-radiative 
cross-section, for scattering without the emission of a photon of energy greater than AE, becomes 



CTJV = CTq -I- (J2N 

Summing and averaging over the spin-states 

5 Y.^n'*fu){u*u') = Spm{{f + iM)74(/ + *m)74^} 

mm' \ t^l 

" ~ 8^ ^^^"^ ^^'^ (l*T474^ + 74/74^) } = Spur 



(633) 



Spur {i/x(/ - f){f' -f)] = Y^ <f 



Hence for an unpolarized electron beam the non-radiative cross-section is 



(634) 



ON 



(-i(.».i^+»-i)^)-o-(^ri''(rf^"S$ (-) 

Since we are working only to order in the radiative corrections, the second term in (635) may be replaced 

by 

-^^-0 (636) 

and then we find 

<'» = (l-^6ogi?i + i-i)«!V. (637) 



37r V 2A£ 6 5y ^2 

Formulae (628) and (631) arc exact for electrons of any energy; while (632) and (637) are valid only for 
slow electrons, terms of order higher than aq^ being neglected. 

To remove the dependence of (637) on r we must consider the cross-section for scattering with emission 
of a photon of frequency greater than r. Since the electron is now considered to be slow, the maximum 
possible energy of the photon is 

hck^^^ = S - mc^ « —IpI 2 (638) 
2m 

and therefore for all possible photons the momentum ti\k'\ will be very small compared with the electron 
momentum Thus the recoil of the electron, arising from the momentum carried off by the photon. 
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can always be neglected. The matrix element and probability for radiative scattering will be given by (618) 
and (620), even when the photon takes away a large fraction of the kinetic energy of the electron. 

We imagine a scattering experiment in which only the direction of the emerging electron is measured 
and not its energy. Then the radiative cross-section 'an will measure the total probability of scattering the 
electron into a solid angle dO. with emission of a photon having any frequency between the lower limit r 
and the upper limit fcmax given by (638). The observed cross-section will be 

(Tt = CTjv + '^R (639) 

with the same low-frequency cut-off r in both ctat and ur. Thus gt gives the cross-section for scattering into 
a given solid angle dfi with or without photon emission. Being directly observed, gt must be divergence-free 

and independent of r. 

In the radiative scattering process wc may take the final momentum of the electron to be \hp' where 
< A < 1 and p' satisfies (627). Instead of (627) the conservation of energy now gives by (638) 

n\p\^{l - A^) = 2mc\k'\ (640) 

According to (620), the probability for scattering the electron into a state Xhp' , with emission of a photon 
in any direction with frequency in the range (fc', k' + dk') is 

,,,, 2q; dfc' In - An'P , , , ,o 
--(^)=3;^^^^l^°l ^'''^ 

This corresponds to a differential cross-section 

for scattering into the solid angle d^l, neglecting now the term in (3 which appeared in (631) since (642) is 
itself of order a(3^ and higher terms are neglected. The factor A comes in from According to (640) 

dk' 2XdX 

Hence the radiative cross-section integrated over the quantum frequency is 
where by (640) and (638) 



T is the initial kinetic energy of the electron given by (638). 

Now we can combine (637) and (644) to give by (639) and (629) 



2a 

3^ V""° 2r ■ « 5 / 2 



= ( 1 - ^ ( log ^ + I - i ) 4/?2 sin' "t, ) ^0 



1 (646) 

Here we used the following trick: the integral over A blows up at A = 1. Therefore we subtract from the 
numerator its value at A — 1 , which makes the integral behave decently and permits us to change the upper 
limit from A^ to 1, for small A£^'s. We also have to add then the integrand with the numerator having 
A = 1; this gives then a logarithmic term which combines with (637) to give the first part of (646). 
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Formula (646) gives a result of the form 




(647) 



where for low velocities f{9) is independent of T and of order 1 compared to the logarithm. For any special 
potential f{9) can be calculated. 

From (647) we see that the observable radiative correction is not of order a but of order 



which is much smaller if v is not rclativistic. Thus the correction cannot he observed at all in a non- 
relativistic scattering experiment. In the relativistic region the effect is actually of order a as indicated by 
(647), but the correct formula is then enormously more complicated. 

The exact formulae in both non-relativistic and relativistic cases have been published by J. Schwinger, 
Phys. Rev. 76 (1949) 790. 

An experimental test in the relativistic range is just on the limits of the possible. See Lyman, Hanson 
and Scott, Phys. Rev. 84 (1951) 626. Scattering of 15 MeV electrons by nuclei were measured with very 
good energy-resolution, /S.E/E being 1 - 3 %. In this case it was only the non-radiative cross-section ctat 
that was observed, and so the radiative correction given by the relativistic form of (637) becomes quite 
large. In fact the radiative correction in gn in the relativistic range becomes of the order 



according to Schwinger, Phys. Rev. 76, 813, Eq. (2.105) (with misprint K for fc, in my notation /i.) In 
the conditions of the experiment of Lyman-Hanson-Scott (649) becomes of the order 5% and was clearly 
observed, the experimental errors being ^ 2%. However, (649) arises mainly from the low-energy virtual 
photons, with energies going down to IS.E. What is observed is just the decrease in the non-radiative 
cross-section due to the competition of radiative scattering with energy- loss in the range [/S.E,E\. Thus 
the measurement of (649) by LHS is actually only a very inaccurate measurement of the cross-section for 
bremsstrahlung, which could be observed much more accurately by observing the photons which are really 
emitted. 

The theoretically interesting part of the radiative corrections is the part which is not just the effect 
of real bremsstrahlung. This part is given by the terms in (Tjv which are of order 1 compared with the 
\og{/S.E/E) appearing in (649). For example, we would have to observe with sufficient accuracy to see the 
terms (| - i) in (637) if we wished to verify the theoretical radiative corrections at low velocities. In the 
relativistic range the "genuine" radiative corrections are of order 



instead of (649), i.e. about 2% in the LHS experiment. To detect such effects is already possible, but to 
observe them accurately in a scattering experiment seems not very hopeful. 

This is all we can say at present about radiative corrections to scattering by an electrostatic potential. 

A. The Magnetic Moment of the Electron 

The scattering by an electrostatic potential, the two terms in (617) were lumped together. Both gave 
contributions of the same order of magnitude, a{q^ / ji^) in the cross section. What then is the meaning of 
the special form of the second term in (617)? This term has no infra-red divergence and therefore should 
be particularly simple to interpret experimentally. 




(648) 




(649) 




(650) 
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Consider scattering of a slow electron by a slowly-varying^^ magnetic field. The potentials (534) can 
then be taken to be a pure vector potential, so that 

64(9) = (651) 

The matrix elements of 71, 72, 73 between positive energy electron states are of the order {v/c). Hence Mq 
given by (543) is of order {v/c). The first term in (617) is thus of the order a{v/c)^ while the second is of 
order a{v/c). Therefore the second term in (617) is the main term in considering magnetic effects, and the 
first term can be ignored. The meaning of the second term must be a change in the magnetAc properties of 
a non-relativistic electron. 

As we saw in discussing the Dirac equation, (Eq's (99) and (100)), an electron by virtue of its charge 
(— e) behaves in non-relativistic approximation as if it had a magnetic moment 

M = -— (652) 
2mc ^ ^ 

This moment has an energy of interaction with an external Maxwell field {E, H) given by 

Hm = -M{cr -H-ia-E) (653) 

the term which appears in the non-relativistic Schrodinger equation (100). 

Now suppose that the electron possesses an additional magnetic moment 5M which docs not arise from 
its charge. Such an additional moment is called "anomalous" . To give the electron an anomalous moment, 
we only need to add arbitrarily a term proportional to (653) to the Hamiltonian. Comparing (654) with 
(97) and (98), we see that (653) is a relativistic invariant and can be written 

Hence an anomalous magnetic moment 5M will be given to the electron if the term 

Lm = -\i5MY,Y. ^M-^M- (655) 

is added to the Lagrangian. This refers still to the one-electron Dirac equation. In the theory of the 
quantized Dirac field, the corresponding addition to the Lagrangian density (410) is 

where it is supposed that the anomalous moment is interacting with the external Maxwell field. The 
addition (656) to the Lagrangian^^ gives a relativistically invariant description of an anomalous moment. 

Consider the effect of (656) on the scattering of an electron by the potentials (534). Treating the 
scattering in Born approximation and using (420), the contribution of (656) to the scattering matrix element 
is 

^^ = Y.^ HxWMx)F;1.{x) dx (657) 

the integral being over all space-time. Using (542) for the initial and final electron wave-functions, and 
defining q, e by (544) and (545), this matrix element becomes 

= Z^(" (^uvUjilneu - gi/C^) = I— [u {0 - ^0u\ (658) 

where we used 7fe7^ = idm, k,£,m = (1,2,3) cyclically permuted. Since we have also assumed (588) we 
may write simply 

Um = -i^iu'^^u) (659) 
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Now comparing the matrix element (659) with (617), we see that the magnetic effect of the second-order 
radiative correction to scattering is exactly described by saying that the electron has an anomalous magnetic 
moment 5M given by 

a eh 01 , r 

SM = -- =+— M 660) 

Aw mc 2-K 

This is the famous Schwingcr correction to the electron magnetic moment, which wc have now calculated. 
Not only for scattering but for all phenomena in the non-relativistic range, the magnetic part of the second- 
order radiative correction to the motion of an electron is equivalent simply to the anomalous magnetic 
moment (660). 

This anomalous moment has been extremely accurately confirmed experimentally by*® Kusch, Prodell 
and Koenig {Phys. Rev. 83 (1951) 687) who find 

^ = 0.001145 ±0.000013 
M 

The calculated value including a fourth-order correction found by Karplus and Kroll {Phys. Rev. 77 
(1950) 536) is 

— = — - 2.973 {^\= 0.0011454 
M 27r V y 



B. Relativistic Calculation of the Lamb Shift 

To make a correct relativistic calculation of the Lamb shift we have to repeat the treatment of line-shifts 
and line-widths which we gave earlier, only now using the relativistic theory for the atom. So we should 
set up the equation of motion of atom plus radiation field in the Bound Interaction Representation. The 
equation of motion is then given by (245), (247), only with the operator now describing the system of a 
relativistic atom. The solution of (245) can then be found as in the non-relativistic case, using the known 
wave-functions of the stationary states of the atom. In this way the Lamb shift was actually calculated 
by Lamb and KroU*^, Phys. Rev. 75 (1949) 388. However, in their calculations Lamb and Kroll had 
troubles with the subtraction of the divergent mass-renormalization effect. Because the calculation was all 
in terms of the atomic wave-functions, it was not possible to keep using relativistically invariant notations 
all the way through. Hence the mass term could not be clearly separated from the remaining finite terms 
by its dependence on the particle momentum p, as it was separated for example in Eq. (566) during the 
calculation of radiative corrections to scattering. The final result of Lamb and Kroll was uncertain because 
of this difficulty in the mass separation. They obtained the correct answer 1052 Mc but only by making 
use of the experimentally measured value a/27r of the electron anomalous magnetic moment.*^ 

Learning from the calculation of the radiative corrections to scattering, we see that to make a clear 
separation of the mass renormalization from observable effects, we must arrange the calculations so that 
the separation is done for a particle of momentum p in a variable Lorentz system. Then by varying the 
Lorentz system we can vary p and identify the mass term unambiguously as the expression which has the 
correct dependence on p. To work in a variable Lorentz system, we must work in a representation which 
is independent of the Lorentz system, so that the calculations are formally invariant. The only convenient 
invariant representation is the Free Interaction Representation. 

Hence our program is to set up the equation of motion first in the Free Interaction Representation, then 
carry through a relativistically invariant calculation to identify and cancel the divergent renormalization 
effects unambiguously. After this we must transform to the Bound Interaction Representation for the final 
calculation of the line-shift. This two-stage calculation is absolutely necessary in order to get the right 
answers. It was Schwinger's invention. 

In the FIR the equation of motion is 



ih^ = {H%t)+H'{t))^ 



(661) 
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with given by (532) including the mass-rcnormalization term Hg- The first stage in the solution of 
(661) is to write 

*(t) = Qi$(t) (662) 

where Vti(t) is defined as in (492), only with eipjitjj replaced by [eij^Ji.ijj^iSmc^ijj'ijj], and the function gA{t) 
is as before supposed to tend to the limiting form gA{t) ^ 1 at the end of the calculation. The operator 
f2i(f) satisfies 

iti^^^p- = {t) 1 (i) (663) 

for any value of t not in the remote past, so that we may take gA{t) = 1- Hence the equation of motion for 
$(t) derived from (661) is just 

= (664) 

Ht (t) = {fliit))''^ (t) Oi (i) (665) 

The covariant part of the calculation, which is done in the FIR, is just the evaluation of this transformed 
Hamiltonian HT{t). 

Covariant Part of the Calculation 
Let us write 

HF{t) = 02(t) H^{t) ni{t) (666) 

with f22 given by (490). Then 

Hrit) = in2{t)ni{t))-^ Hpit) = S-^Hpit) (667) 

where S, given by (495), is the scattering matrix defined in the absence of the external potential A^. We 
shall now restrict our attention to systems in which only one electron is actually present. We saw by Eq. 
(533) that S applied to a one-clcctron state is equivalent to the unit operator, i. e.g. S produces no real 
scattering or shift of phase in one-particle states. Therefore in discussing the hydrogen atom we may simply 
omit the in (667) and write 

Hrit) = Hpit) (668) 

Now Hp{t) is just the term involving H^{t) in the series expansion (421). In fact we have, by direct 
multiplication of the series for ili and 

Hrit) = i^Yj ^J---Jdtidt2...dtnP {H'{t), H\tt), H\tn)} (669) 

the damping functions g(ti) being always understood when they are not written explicitly. Now (667) is 
directly related to the operator U given by (535), namely 

i r°° 

U = -- HF{t)dt (670) 

We write 

Hrit) = H%t) + Hriit) + HT2{t) (671) 

expanding Ht in powers of the radiation interaction just as U was expanded in (536)-(540). But the matrix 

elements of (t/2 + C^2 ) referring to one-clcctron transition have already been calculated and arc given by 
(617) assuming non-relativistic velocities for the electron. This enables us to write down immediately a 
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formula for the operator Ht2, valid for one-electron transitions at non-relativistic velocities. In (617) each 
factor q\ may be replaced by {—id/dx\) operating on the potentials (534). Then (617) becomes 



An mc^ J ^ dx\ 



And using (670) and (668) we deduce 

ie a 



Ht2^^^ {log ii + ii - 1} 1 i^inUmx) d^x 



47r mc J ^ ox\ 



(673) 



+ - I ^ 2^ —'yxJp(x)d''x 

A 



Specializing now to the case of a time-indcpcndcnt electrostatic potential given by 

Ai = i(p{r), V = -e(p, (674) 

H%t) = j V{r){ip*tp)d^r, (675) 

we find^^ 

"-^-■^.['•*Tr^Vi-\]ji^'yW*)^'r ^^^^^ 



47r/i J 



The calculation of Ht2 , which is the main part of the Lamb shift calculation, can thus be taken over directly 
from the scattering calculation. In particular, the elimination of divergent renormalization effects does not 
have to be considered afresh. Once (G76) has been derived, everything is finite and we are permitted to 
carry out the rest of the calculation in a non-covariant way. Note however that the low-energy photon 
frequency cut-off r still appears in (676). We expect that this dependence on r will finally disappear when 
the effects of Hti are considered, the same cut-off being used in both Hti and Hj'2- 

We next turn to the evaluation of Hti, which is related to Ui in the same way as Ht2 to {U2 + U2)- 
According to (484) the matrix element of f/i for a one-electron transition between the states (542), with 
emission of the photon (440), is ^° 

where now 

= e^{p' + k' - p) (677) 

is given by the Fourier expansion (534). Note that we cannot yet use the simple form (546) for Mi, because 
we do not know that k' <^ p,q for the photons which will be important in this problem. Using the Dirac 
equation satisfied by u and u', we may write without approximations using (587) and rule 4 on page 87, 

2h^c^ I p-k' p' ■ k' I 

Since p and p' are assumed non-relativistic we may write 

p- k' = p' ■ k' = —jJLk'Q 
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and then 

2 

Ml = 2n^^5^ u' {2 ((y -p)-e')^ + i't'i + u (678) 

Now because we shall be considering only an electrostatic potential (674), ^ is a multiple of 74 simply. Then 
if k'^ is the space- like part of the vector k', we have 

Therefore 

0'^^ + 0^^' = ik'^^' {2^4) (679) 

Now this term (679) is small compared to the other term in (678), because ^' involves the matrices 71, 72, 73 
while ^ involves 74, and the matrix elements of 71,72,73 for non-relativistic transitions are small, of the 
order of {v/c). The term (679) in fact describes magnetic radiation, whereas the other term in (678) gives 
electric. The electric term gives an effect of the order of the Lamb shift. Hence in our approximation we 
may neglect the magnetic term and write 

which is the same result as we should have obtained from (546). 

Let Za{x) be the Hertzian vector corresponding to the radiation field potentials Ax{x), defined by 

Ax{x) = j^Zxix) (681) 
Then the matrix element of Z\{x) for emitting the photon with potentials (440) is 

Zx{x) = ^ele-^'='- (682) 

Compare with (422)^^. 

Thus the operator Ui which has the matrix element (680) may be written^^ 



Ui = I dxi){Z ■ d)4 i'{x) (683) 



Using (670) and specializing by means of (674), this gives 



e 



Hti = — I i^*{Z ■ dV)7pd-'r (684) 



This completes the evaluation of Ht- 



Discussion and the Nature of the ^-Representation 

To understand the effect of the transformation (662), we observe that if ^'(i) is the state of one real electron 
in the absence of an external field, then $(i) will be independent of t and will represent one "bare" electron 
with the same momentum as the real electron. In an actual hydrogen atom, we may consider the state 
^{t) to a very good approximation as a superposition of states of a single real free electron; then <I>(t) is 
a superposition of states of a single bare electron with the same distribution of momenta. Thus by the 
transformation from to $ we have eliminated the radiation field surrounding the electron, all remaining 
efl[ects of this radiation field being contained in the operator Ht- 

It is essential at this point to make sure that in the $ representation the field-operators are still free- 
particle operators, with the correct equations of motion for operators in the FIR. Thus the transformation 
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(662) is only a transformation from one set of variables to another within the FIR, and does not take us 
out of the FIR. This point was never explained properly by Schwinger in his papers, although he no doubt 
understood it himself. 

Let then Q{x) be a field-operator of the representation. Being a FIR operator, Q{x) satisfies 

ih'^ = [Q,Ho] (685) 

where Hq is the Hamiltonian of the Dirac and Maxwell fields without interaction. In the $ representation 
the corresponding field-operator is 

Q'{x) = (fl^it))-^ Q{x) Cli{t) (686) 

Now rii(t) is given by (492) where we now take the limit gA = 1- The operators appearing in (492) are all 
FIR operators satisfying equations of motion of the form (685). When the integrations in (492) are carried 
out, the integrated terms will still have the same time variation given by (685), except for those terms 
which correspond to transitions in which energy is conserved. The energy-conserving matrix elements will 
have an explicit linear dependence on t after integration, which is not in accordance with (685). Therefore 
we conclude that the equation of motion 

ih^ = [ni,Ho] (687) 

is valid for all matrix elements of Qi which are not diagonal in the FIR. The same equation of motion 
is satisfied by {Qi{t)) ^ with the same condition. Now we have seen that ^i{t) has no matrix elements 
diagonal in Hq which give transitions cither from or into onc-particlo states. Hence (687) holds for all 
matrix elements in which either initial or final state is a one-particle state. 

Combining (687) and the corresponding equation for (fii)"^ with (685) and (686), we have 

ih^ = [Q',Ho] (688) 

This equation (688) is valid for all matrix elements between one-particle states. Thus we can conclude that 
so long as we confine attention to a onc-clcctron system, (688) is satisfied by all field operators of the $ 
representation, and so the $ representation is still within the FIR. 

When we consider systems containing more than one particle, then Oi (t) will have an explicit dependence 
on time in addition to (687). Then the $ representation would no longer belong to the FIR. And this is 
physically reasonable, because in many-electron systems it is not in general possible to transform away the 
radiation interaction completely, the radiation interaction by itself giving rise to real effects such as M0ller 
scattering which we should not wish to transform away. 



Concluding Non-Covariant Part of the Calculation 

Having established that the ^-representation in which (664) holds is the FIR, we now proceed at once to 
transform to the BIR in which we shall finish the calculation of the Lamb shift. To transform to the BIR, 
we write 

m) = e'^ot/fte-'{^^o-Fi?=}t/ft (689) 
The new wave- function $'(t) thus satisfies 

ih— = {Hriit) + HT2{t)} (690) 

where Hti and IIt2 are given by (684) and (676), only now the tp* and ip operators have the time- variation 
of the Dirac field in the external potential V. 
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To solve (690) we may now use precisely the method that we used for solving (245) in the non-relativistic 
treatment. There are only two differences, (i) we have now the extra term Ht2, and (ii) the form of Hti 

is different form (247). 

Since we are working only to second order in the radiation interaction and Ht2 is already of that order, 
Ht2 is to be treated only as a first-order perturbation. Then IIt2 will have no effect on the line-width F, 
and will contribute to the line-shift A£J just the expectation-value of (676) in the state Vo of the atom, 
namely 

AE2 = ^{log|: + ii-i} y'(vV)|V^opdV-^y'^o*74(«-VF)V'od^r (691) 

The effect of the change from (247) to (684) is that the matrix element (n m) given by (256) now 
becomes everywhere replaced by 

We see this by comparing (247) and (684), noting that 

6C f 

j(.r,t) ^ —tjj*'d)VV / dt 

hjjL J 



or 



J hn\k\c J 



In the previous calculation we used a non-relativistic dipole approximation for which gave according to 
(272) 

We shall again use a dipole approximation and drop the exponential factor in (692). Then taking for the 
atom the non-relativistic Hamiltonian ^ 

H=^+V (694) 
2m 

the difference between (692) and (693) becomes 

J^in m) - jl{n m) = j^^^j V^^rf'Vj (£„ -E,,- hc\k\) (695) 

where we used [p^,PiJi\ = and / Puip^p^tpmd^i' = — J tpuPtiP'^'^Prnd^r. This difference vanishes for 
transitions in which energy is conserved. Hence the value of P given by (262) is unaffected by the change 
from j to J. The value F calculated previously is still valid in the relativistic theory, except for very small 
effects from magnetic radiation which we have neglected. 

Using (695) we have the simple relation between and J^, 

4{nm)=j';inm)^^^^^ (696) 

Using formula (261) with J substituted for j, the contribution from Hti to the line-shift becomes instead 
of (273) 

AF = r^S- {En-E,y\p^o\^ 

' en^m'^hc^ k ^ En - Eo + hc\k\ ^ ' 

The integral is now convergent at high frequencies and only divergent at low frequencies where the cut-off 
r is now required to make it finite. The shift (697) would be zero for a free particle, and so there is no 
question of subtracting away a mass-renormalization term as we did from (273); in the relativistic treatment 
the mass-subtraction was already done long before this stage of the calculation was reached. 
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Integrating (697) directly over A, and taking r to be small compared with {En — Eq), we find 

^E. = -^-^^(E,-E„fl,^flo,^.S^ (698) 

n 

This is precisely the non-relativistic line-shift (278) with r substituted for K. Defining {E — -Eo)av by (279) 
and using (281), we have 



This combines with (691) to give for the total level shift 

= 3^? {'»^ 2^:: + - i} /(V^ V) <,3r - ^ / • VV)*, (700) 

a result which is completely divergence-free and independent of r. 

The second term of (700) represents the effect of the anomalous magnetic moment of the electron on 
the energy levels. It therefore gives a spin-dependent shift which modifies slightly the fine-structure which 
arises from the Dirac magnetic moment. To evaluate this term we use the Dirac equations (see (38)). 

mc^ji'ipo = {Eq - y)V'o + ific{a. • V)V'o (701) 

TOcVo74 = V'o {Eq -V)- ihc{W% • a) (702) 

Using both (701) and (702) in turn in the second term of (700) and adding the results using 0*74-1-740;* = 
the terms in {Eq — V) cancel and we find^^ 

2mc^ j V'o 74(0; • W)V'o d^r 

= -ihc j {(Wo • a)(a • VV)tpo + %{cx • WV){a • Wo)} d^r 

= -ihc y {(vV'o • o-)(o- • vy)V'o + V'o(o- • vy)(<7 • vv'o)} rfV (703) 

= -ihc j {Vo [VV'o -VV + ia- (Wo X VV)] + [Wo -VV + ia- (Wo x VV)]} d^r 

= ihc j {-h(V2y)VoV'o - 2iV*[o- • {yV X V)]V'o} d^r 

Here we used a' = ecTj (see p. 20), = I, and the formula 

{a- ■ B){a ■C) = {B-C)+i{cr-BxC) 

See Dirac, The Principles of Quantum Mechanics, third edition, p. 263. 
Now suppose y is a central potential, a function of r only. Then 

VyxV = i^(rxV) = i^fix') (704) 
r dr r dr \h J 

with L, the orbital angular momentum, is given by (39). In this case (700) becomes 

^2 



AlTll^h 



° /«(;f)«'-^)*<'''- 



(705) 
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In the non-relativistic theory of the hydrogen atom, the quantum number j given by (72) is related to the 
operator (cr • L) by 

Therefore for the hydrogen atom 

where g is the coefficient (706). 

For s-states we have q = Q and so the shift reduces to (see (284)) 

3 1 r »v,^2 



ia-" 1 



for the state with principal quantum number n. For all other states the term in |i/)o(0)P is zero, and the 
shift depends only on the integral 

^)=j ^,\M'd'r (709) 
The value of (709) is given by Bethe, Handbuch der Physik, Vol. 24/1, p. 286, Eq. (3.26) [20]. It is 



'•V £(€+i)(£+l)n3a3 
where Oq is the Bohr radius of the hydrogen atom. Hence the shift for states with £ becomes 

For the relative displacement of the 2s and 2pi/2 levels, which in the Dirac theory were degenerate, we have 
finally by subtracting (712) from (708) 

AE=^Ry {log ^^^^^^ +l-h + k} = 1051 Megacycles (713) 



Accuracy of the Lamb Shift Calculation 

With the relativistic calculation of the Lamb shift which we have done, this course comes to an end. In this 
calculation we have met and seen how to overcome all the problems of mass and charge renormalization. We 
can say we now have a workable quantum electrodynamics which will give finite and unambiguous values 
for all observable quantities. 

This calculation of the Lamb shift was of course not exact. The two most important errors were 

(i) using non-relativistic wave-functions and the dipole radiation approximation in evaluating the effects 
of Hti; 

(ii) neglecting the finite mass of the proton. 

To correct these errors, very long calculations have been done. In connection with (i), Baranger [21] has 
calculated the effect of using relativistic theory in the treatment of Hti and he finds the observed shift 
increases by 7 megacycles. The effects of (ii) are being looked at by Salpeter [22] but are not greater than 
1-2 megacycles at most. In addition we have not considered 
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(iii) effects of fourth order in the radiation interaction. These are being looked at by Kroh and others 
[23]; they are certainly less than 1 megacycle. [24] 

Therefore the theoretical value of the Lamb shift now stands at 1058 ± 2 megacycles. There is no clear 
discrepancy between this and the experimental value 1062 ± 5 though a discrepancy may be found when 
the experiments and the theory are further cleaned up. 



The End 



Typist's Afterword 



Both Kaiser's admirable Drawing Theories Apart [8] and Schwcber's masterful QED and the Men Who 
Made It [7] refer frequently to the famous lectures on quantum electrodynamics given by Freeman Dyson at 
Cornell University in 1951. Two generations ago, graduate students (and their professors) wishing to learn 
the new techniques of QED passed around copies of Dyson's Cornell lecture notes, then the best and fullest 
treatment available. Textbooks appeared a few years later, e. g. by Jauch & Rohrlich [25] and Schweber 
[6], but interest in Dyson's notes has never fallen to zero. Here is what the noted theorist E. T. Jaynes 
wrote in an unpublished article [26] on Dyson's autobiographical Disturbing the Universe, 1984: 

But Dyson's 1951 Cornell course notes on Quantum Electrodynamics were the original basis of 
the teaching I have done since. For a generation of physicists they were the happy medium: 
clearer and better motivated than Feynman, and getting to the point faster than Schwinger. 
All the textbooks that have appeared since have not made them obsolete. Of course, this is to 
be expected since Dyson is probably, to this day, best known among the physicists as the man 
who first explained the unity of the Schwinger and Feynman approaches. 

As a graduate student in Nicholas Kemmer's department of theoretical physics (Edinburgh, Scotland) I 
had heard vaguely about Dyson's lectures (either from Kemmer or from my advisor, Peter Higgs) and had 
read his classic papers [27], [28] in Schwingor's collection [4]. It never occurred to me to ask Kemmer for a 
copy of Dyson's lectures which he almost certainly had. 

My interest in the legendary notes was revived thirty years later by the Kaiser and Schweber books. 
Within a few minutes Google led to scans of the notes [29] at the Dibner Archive (History of Recent 
Science & Technology) at MIT, maintained by Karl Hall, a historian at the Central European University 
in Budapest, Hungary. He had gotten permission from Dyson to post scanned images of the Cornell notes. 
Through the efforts of Hall, Schweber and Babak Ashrafi these were uploaded to the Dibner Archive. To 
obtain a paper copy would require downloading almost two hundred images, expensive in time and storage. 
Was there a text version? Had anyone had retyped the notes? Hall did not know, nor did further searching 
turn anything up. I volunteered to do the job. Hall thought this a worthwhile project, as did Dyson, who 
sent me a copy of the second edition, edited by Michael J. Moravcsik. (This copy had originally belonged 
to Sam Schweber.) Dyson suggested that the second edition be retyped, not the first. Nearly all of the 
differences between the two editions are Moravcsik's glosses on many calculations; there is essentially no 
difference in text, and {modulo typos) all the labeled equations are identical. 

Between this typed version and Moravcsik's second edition there are few differences; all are described in 
the added notes. (I have also added references and an index.) About half are corrections of typographical 
errors. Missing words or sentences have been restored by comparison with the first edition; very infrequently 
a word or phrase has been deleted. A few changes have been made in notation. Intermediate steps in two 
calculations have been corrected but change nothing. Some notes point to articles or books. No doubt new 
errors have been introduced. Corrections will be welcomed! The young physicists will want familiar terms 
and notation, occasionally changed from 1951; the historians want no alterations. It was not easy to find 
the middle ground. 

I scarcely knew before beginning this project. My friend (and Princeton '74 classmate) Robert 

Jantzen was enormously helpful, very generous with his time and his extensive knowledge ofWT^. Thanks, 
Bob. Thanks, too, to Richard Koch, Gerben Wierda and their colleagues, who have made I^T^iX so easy on 



132 



133 



a Macintosh. George Gratzer's textbook Math into l^Tjipi was never far from the keyboard. No one who 
types technical material should be ignorant of I^T^jX. 

This project would never have been undertaken without the approval of Prof. Dyson and the efforts 
of Profs. Hall, Schweber and Ashrafi., who made the notes accessible. I thank Prof. Hall for his steady 
encouragement through the many hours of typing. I thank Prof. Dyson both for friendly assistance and 
for allowing his wonderful lectures to become easier to obtain, to be read with pleasure and with profit for 
many years to come. 

David Derbes 
Laboratory Schools 
University of Chicago 
loki@uchicago.edu 
11 July 2006 



Notes 



"is" substituted for "in" 

^ Both the first and the second editions use A for the Laplacian differential operator; as there are many 
quantities with A in them to denote a change or shift, the use of for the Laplacian seems a safer choice. 

^ The text symbols "div", "grad" and "curl" have been replaced throughout by "V •", "V", and "Vx", 
respectively. For example, the equation (3) originally read 

^ "differential" replaces "diff." 

^ In the second edition, this equation lacked a label. 

^ In the first edition, the coefficient of a • V is +ihc 

E lacked the exponent 2 in Eq. (47). 
* Cs lacked a subscript s 

® The quantity ^(mc^ — E)^ — {rri^c^ — -E^)^ lacked an exponent 2. 

The spur of a matrix A is the sum of its diagonal elements, and denoted Spur A or Sp A. This term 
is also, and more commonly, known as the trace of a matrix, denoted Tr A. 
"ever" was written for "even" 

This equation differs from the form given in the first edition by a factor of — ;|, but this form also 
seems incorrect; perhaps the correct relation is 

ri 



V{r) = —An / — dri / r2^p{r2) dr2 
Jo ^1 ^0 



In the second edition, the factor "dEi" was missing 

In the literature, the gauge condition V • A = is now called "Coulomb gauge" ; the choice of the 
gauge condition df^A^ — (using the Einstein summation convention) is called "Lorentz gauge" . (See also 
Eq. (588).) In the first edition, Dyson uses Einstein's convention; in the second edition, Moravcsik does 
not. Sec also the parenthetical remark following Eq. (234a). 

Rewritten. In v.l, Dyson writes "The factor P1E2 — P2E1 is invariant for a Lorentz transformation 
parallel to the 3 axis." In v. 2, Moravcsik writes "It is worth noting that the factor P1E2 —P2E1 is invariant 
under Lorentz transformations leaving the xi and X2 components unchanged." 

These three articles may be found in Schwinger, Selected Papers on Quantum Electrodynamics. 

Deleted "for"; the original statement read "condition that for its SI{n) = 0" 

Eq. (185) lacked a label in v. 2. The discussion beginning at Eq. (182) and continuing to Eq. (186) 
is unusually different between the editions. What is here follows Moravcsik's v. 2 with the addition of the 
phrase "the matrix element" at Eq. (185). 

A 5 was missing: the equation read ihO = [51{^),0{a)]. 

The notation originally used for anticommutators was AB + BA = [A, 5] + .The more familiar {^1, B} 
has been used instead. 

Here, was substituted for the original <j) (the variable in Eq. (187)) for clarity. 
In the last commutator, the operator ak>x lacked the tilde. 
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"Bleuler" was written "Bleuber" . 

Originally, the phrase read = dA/dX/^; this seemed confusing as the original potential is itself A^. 

Eq. (222) lacked the lower Hmit fco > on the integral. 
26 £)W(^x) was added on the left-hand side for clarity 
2'^ Eq. (230) lacked the sum over /z; cf. Eq. (170). 

"obtained" was inserted; statement read "per unit time is using ..." 
^® The subscript A was missing in the term jxA{x'). 

The exponential in the first integral lacked i; it had the argument —k r. 
31 Bethe & Salpeter, ref. [20], p. 249, Eq. 59.7 
3^ There was no subscript "o" on the vacuum expectation value. 
33 "if" substuted for "it" ; and "gives" added to the previous sentence. 

3^ The limits on the integral were —1 and +1, and the value of the integral was given as |. 
3^ The expression for Gq is not here in the original, but it appears before Eq. (240). 
3^ The expression for p is not in the original. 
37 ao = 0.529177 x IQ-^ cm; Ry = 13.6056 eV. 

3^ "time-independent" substituted for "time-dependent" , which describes the Heisenberg representation. 
3^ Sum over /it inserted. 

Following "given by (263)", the second edition has the phrase "with o suffices". The first edition lacks 
this phrase. As the sentence makes more sense without it, it has been deleted. 
'^^ "s states" replaces "a; states" . 

The coefficient of the first integral in Eq. (283) was h; it has been replaced by h^. Also, Eq. (283) 
lacked a label in the second edition. 

"^3 Here, the Bohr radius was denoted a; it seemed reasonable to use Qq instead. 

44 iijgi) replaces "being" 

45 K^yji replaces "be" 

'^^ In V.2, the Hermitian conjugate b'^ lacked the asterisk. 

The argument of the exp function originally had the factor (t — to); this has been replaced by the factor 
(t-f). 

'^^ The function Fi,{k) was written as a scalar, F{k). This is misleading; the right-hand side is a vector 
function, because it is linear in 7^. So F{k) was promoted to F^{k). 
^® The phrase "integral in the" was inserted, 
"definite" replaces "indefinite" 
In v.2, this reads "Spur X^/^T^Ti^ = 4" 

The bottom limit of was added to the last integral sign. Note that the change of variable is easier to 
follow by first observing 

1/2 



/ dz{z-z^)f{z-z'')=2 dz{z-z^)f{z-z'') 
Jo Jo 



because the expression {z — z^) is symmetric about z = \. 

^3 The original read "Then by (390) in (396) the second term is zero" . 
"proton" replaces "photon" 

The potentials had a subscript v and the gamma matrices a subscript A. 
"from" replaces a second "in" 
^7 The factor X2 had a superfluous superscript "'" . 
A, B added for clarity 

The words after "because", "using 4) . . . (451)" were added. 
^° In the second edition, the intermediate calculation was wrong; however, the conclusion was correct. It 
was rewritten up to Eq. (464). See the trace theorems Eq. (585) et seq. Also, note the identity 



Sp ((^^(^2(^35^4) = a2)(a3- 04) - («!• a3)(a2- 04) + (ai- a4)(a2- as) 
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The third equation lacked a subscript "0" on the variable k^, the fourth equation lacked a superscript 
""' on the variable feg- 

Eq. (468) lacked a label. The word "simply" was inserted. 

"Thomson" replaces "Thompson". 

The first spinor u lacked a bar; u replaces u. 

The expression for Tq was added. 

66 In the S'^i edition of Heitler's book, see §25. 

67 In the 3'''* edition of Heitler's book, see §26. 

6^ The first potential lacked a slash; ^ replaces A. 

6^ The phrase "in which all emission operators stand to the left of all absorption operators" was lost in 
the transition from the first edition to the second. 

™ Nowadays called "normal order", this ordering arises in connection with Wick's Theorem: 

(time ordered operators) = (normal ordered operators) + (all contractions) 

the contractions being equal to the propagators Sp, Dp, and so on. 
''^ In Schwinger's QED anthology. 
"^^ In Schwinger's anthology. 

7^ Both the time ordering brackets lacked a right bracket. These were added. 

''^ Unlike Dyson, Moravcsik cited Eq. (585) as well as Eq. (376). In Eq. (585) are Dirac matrix identities 
which establish the equality between the first two integrals in Eq. (522). Logically these identities should 
have been introduced before Chapter 6, but nothing prevents a reader making use of a "forward" reference. 

''^ The subscript on the second ^ was originally "o"- It has been replaced with a subscript "d". 

^"6 Again, a subscript /it has been appended to the function F{k) to make it a Lorentz vector. See note 
31 at Eq. (371). 

'^'7 The last curly bracket was missing; it has been added. 

"^^ The times symbol x was inserted. 

''^ This sentence read formerly ". . . the factor e{q — e — e') is a continuous function of (e + e') and tends 
to /(g) ..." 

A sentence, "Here, {k-py)z denotes the scalar product of the space-like parts of the vectors k and Py^, 
was deleted, because the expression fc • p^^ is self-explanatory. In both the first and second editions, scant 
attention was paid to three-vectors; sometimes an overhead arrow was used, but these were very few. In 
this typed version care has been taken to represent three- vectors with bold type, thus: {A^, Ay,Az) = A. 

"in" inserted; the original read "lying the range" 

In the first line, twice in the fourth line, and in the first appearance in the sixth line, the expression 
lu'*^! lacked the exponent 2. These have been supplied. 
The modulus bars around p were absent, 
"slowly-varying" replaces "slowly-carrying" 
85 Yov "Lagrangian" read here "Lagrangian density" . Field theorists, by an abuse of language, often say 
the first and mean the second. 

^6 The original citation lacked Prodell's name. 
^7 In Schwinger's anthology. 

The notation "Mc" is outmoded; usually this is written "MHz." 
^® "a" is the fine-structure constant, j^f^^l "oc" is the Dirac matrix. Originally "a • V" was rendered 
as "a • grad" . 

The equation labeling was faulty. In the second edition, what is here labeled (676a) was a second 
(676), and what is here labeled (677a) had no label at all. It might have been all right to leave Eq. (677a) 
unlabeled, except that in the first edition, both Eq. (677) and Eq. (677a) are labeled (677)! This is a 
compromise. 

The comparison with Eq. (422) is not obvious. Perhaps Eq. (438) was meant? 
The integrand was originally written 

lp{Z • grad) i!'i}{x) 
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Since Zx is a Lorentz vector, the gradient must likewise be. So "grad" here must be not V. One goes 
from (682) to (680) by an integration by parts; since there are in (680) apparently the dot product of two 
4- vectors, this supports the identification here of grad = d\. 

In the second to last line, the term ia • {'V'ipo x VV) was written with a dot product between the two 
gradients, rather than a cross product. 
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